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SEQm 
NO: 


Method 


Predicted 

beginniDg 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A<^Alanine C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, F=PhenyIalanine, &=Glycine, H^Histidine, 
I=Isoleucine, IC=Ly8ine, Lr=Leucine, MsMethionine, 
N=Aspi^ragine, P«Proline» Q-<»lutamine» R=Arginine, S^Serine, 
T«Threonine, V-Vallne, W^Tryptophan, Y»Tyrosine, 
X>=Unknown, *=Stop codon, A^possible nucleotide deletion, 
\=^ossible nucleotide insertion 










LSRKAVSDMLTACKQASFHPDVSDEVRTRALRF 

GTECTLGYLDLLEHVLVILQKPTPELKQQLAAFS 

KRVAGAVTELIQAAEAMKGTEWVDPEDPTVIAE 

TELLGAAASIEAAAKKLEQLKPRAKPKQADETL 

DFEEOILEAAKSIAAATSALVKSASAAORELVAO 

GKVGSIPANAADDGQWSQGLISAARMVAAATSS 

LCEAANASVQGHASEEKLISSAKQVAASTAQLL 

VACKVKJ\.DQDSEAMRRLQAAGNAVKRASDNL 

VRAAQKAAFGKADDDDVVVKTKFVGGIAQIIAA 

QEEMLKKERELEEARKKLAQIRQQQYKFLPTEL 

REDEG 


3441 


A 


3 


1584 


NSARGGVGVRGARAMATVQEKAAALNLSALHS 

PAHRPPGFSVAQKPFGATYVWSSIINTLQTQVEV 

KKRJUJRLKRHNDCFVGSEAVDVIFSHLIQNKYF 

GDVDIPRAKWRVCQALMDYKVFEAVPTKVFG 

KDKKFTFEDSSCSLYRFmPNQDSQLGKENKLY 

SPARYADALFKSSDIRSASLEDLWENLSLKPANS 

PHVNISTTLSPQVINEVWQEETIGRLLQLVDLPLL 

DSLLKQQEAVPKIPQPKRQSTMVNSSNYLDRGIL 

KAYSDSQEDEWLSAAIDCLEYLPDQMWEISRSF 

VHNGIAELLVNGKTEIALEATQLLLKLLDFQNRE 

EFRRLLYFMAVAANPSEFKLQKESDNRMVVKRI 

FSKAIVDNKNLSKGKTDLLVLFLVMDHQKDVFKI 

PGTL\HKIVS\VK\LMAIQNGRDPNRDAGYIYCQRI 

DQRDYSNITEKTTIDELLYLLKTLDEDSKLSAKE 

KKKNLLGQFYKCHPDIFIEHFGD 


3442 


A 




822 


SPASGHCRl TvJGAAVAMFfirr VAGRI VOTA AOO 

VAEDKFVFDLPDYESINHVWFMLGTIPFPEGMG 

GSVYFSYPDSNGMPVWQLLGFVTNGKPSAIFKIS 

GLKSGEGSQHPFGAMNIVRTPSVAQIGISVELLDS 

MAQQTPVGNAAVSSVDSFTQFTQKMLDNFYNF 

ASSFAVSQA^PDDTQ/RPSEMFIPANWLKWYENF 

QRRTSTEPSLLENIIWIKINF 


3443 


A 


3 

1 

* 


1373 


SWHVRRRWLEATMAGGMKVAVSPAVGPGPWG 

SGVGGGGTVRLLLILSGCLVYGTAETDVNVVML 

QESQVCEKRASQQFCYTNVLIPQWHDIWTRIQIR 

VNSSRLVRVTQVENEEKLKELEQFSIWNFFSSFL 

KEKLNDTYVNVGLYSTKTCLKVEIIEKDTKYSVI 

VIRRFDPKLFLVFLLGLMLFFCGDLLSRSQIFYYS 

TGMTVGIVASULIIIFILSKFMPKKSPIYVILVGGW 

SFSLYLIOLVFKNLOEIWRCYWOYLLSYVLTVGF 

MSFAVCYKYGPLENERSINLLTWTLQLMGLCFM 

YSGIQIPHIALAIIIIALCTKNLEHPIQWLYITCRKV 

CKGAEKPVPPRLLTEEEYRIQGEVETRKALEELR 

EFCNSPDCSAWKTVSRIQSPKRPADFVEGSSHLT 

PNEVSVHEQEYGLGSnAQDEIYEEASSEEEDSYS 

RCPAITQNNFLT 


3444 


A 


566 


1718 


KGLERTCCAMEESDSEKTTEKENLGPRMDPPLG 

EPG\GSLGWVLPNTAMKJCKVLLMGKSGSGKTS 

MRSIIFANYIARDTRRLGATILDRmSLQINSSLST 

YSLVDSVGNTKTFDVEHSHVRPLGNLVLNLWDC 

GGQDTFMENYFTSQRDNIFRNVEVLIYVFDVESR 

ELEKDMHYYQSCLEAILQNSPDAKIFCLVHKMD 

LVQEDQRDLIFKEREEDLRRLSRPLECSCFRTSIW 
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SEQID 
NO: 


Method 


Predicted 

begianing 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C=Cysteine, D=Aspartic Add, 
£>=Olutaroic Add, F^oPbenylalanine, G^GIycine, H-Histidine, 
I=Isoleucine, K=Lysine, L^Leuciae, M^Methionine, 
N»Asparaglne> P^ProUne, Q=Glutamine, RsArginine^ S=Serine, 
T=Threoninc, V=Vallne, W=Tryptophan, Y=Tyroslne, 
X=Unknown, ^'^^top codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion 










DETLYKAWSSIVYQLIPNVQQLEMNLRNFAEIIE 

ADEVLLFERATFLVISHYQCKEQRDAHRFEKISNI 

IKQFKLSCSKLAASFQSMEVRNSNFAAFIDIFTSN 

TYVlVrVVMSDPSIPSAATLrNmNAJmiFEKLERV 

DGPKQCLLMR 


3445 


A 


566 


1718 


KGLERTCCAMEESDSEKTTEKENLGPRMDPPLG 

EPG\GSLGWVLPNTAMKKKVLLMGKSGSGKTS 

MRSnFAimARDTRRLGATILDRIHSLQINSSLST 

YSLVDSVGNTKTFDVEHSHVRFLGNLVLNLWDC 

GGQDTFMENYFTSQRDNIFRNVEVLIYVFDVESR 

r!>j^jj»jsj^ivixi I I v^o\^L«ij/\JiJL«v<i^ ^rurxr^ir v rjJVLvii^ 

LVQEDQRDLIFKEREEDLRJILSRPLECSCFRTSIW 

DETLYKAWSSIVYQLIPNVQQLEMNLRNFAEIIE 

ADEVLLFERATFLVISHYQCKEQRDAHRFEKISKI 

IKQFKLSCSKLAASFQSMEVRNSNFAAFIDIFTSN 

TYVNfmiSDPSIPSAATLrNIRNARKHFEiaERV 

DGPKQCLLMR 


3446 


A 


566 


1718 


KGLERTCCAMEESDSEKTTEKENLGPRMDPPLG 

EPG\GSLGWVLPNTAMKKKVLLMGKSGSGKTS 

MRSIIFANYL^RDTRRLGATILDRIHSLQINSSLST 

YSLVDSVGNTKTFDVEHSHVRFLGNLVLNLWDC 

GGQDTFMENYFTSQRDNIFRNVEVLIYVFDVESR 

JiL-liisJJJYLrl I Y v^oUL-llAlLl^IN or IJAJWrUJ^VJnLISJYl 

LVQEDQRDLIFKEREEDLRRLSRPLECSCFRTSIW 

DETLYKAWSSIVYQLIPNVQQLEMNLRNFAEIIE 

ADEVLLFERATFLVISHYQCKEQRDAHRFEKISNI 

IKQFKLSCSKLAASFQSMEVRNSNFAAFIDIFTSN 

TYVMWMSDPSIPSAATLINIRNARKHFEKLERV 

DGPKQCLLMR 


3447 


A 


1 


2930 


VLLGPLWDKLSTADHPVIVTMASKRKSTTPCMIP 

VKTVVLQDASMEAQPAETLPEGPQQDLPPEASA 

ASSEAAQNPSSTDGSTLANGHRSTLDGYLYSCK 

YCDFRSHDMTQFVGHMNSEHTDFNKDPTFVCSG 

CSFLAKTPEGLSLHNATCHSGEASFVWNVAKPD 

NHVVVEQSIPESTSTPDLAGEPSAEGADGQAEmT 

KTPIMKJMKGKAEAKKIHTLKENVPSQPVGEALP 

KLSTGEMEVREGDHSFINGAVPVRQASASSAKN 

PHAANGPLIGTVPVLPAGL\QFLSLQQQPPVHAQ 

HHVHQPLPTAKALPKVMIPLSSIPTYSAAMDSNS 

FLKNSFHKFPYPTKAELCYLTVVTKYPEEQLKIW 

FTAQRLKQGISWSPEEIEDARKKMFNTVIQSVPQ 

PTITVLNTPLVASAGNVQHLIQAALPGHVVGQPE 

GTGGGLLVTQPLMANGLQATSSPLPLTVTSVPK 

QPGVAPINTVCSNTTSAVKVVNAAQSLLTACPSI 

TSQAFLDASIYKNKKSHEQLSALKGSFCRNQFPG 

QSEVEHLTKVTGLSTREVRKWFSDRRYHCRNLK 

GSRAMIPGDHRSniDSVPEVSFSPSSKVPEVTCIPT 

TATLATHPSAKRQSWHQTPDFTPTKYKERAPEQ 

LRALESSFAONPLPLDEELDRLRSETKMTRREIDS 

WFSERRKKVNAEETKKAEENASQEEEEAAEDEG 

GEEDLASELRVSGENGSLEMPSSHILAERKVSPIK 

INLKNLRVTEANGRNEIPGLGACDPEDDESNKLA 

EQLPGKVSCKKTAQQRHLLRQLFVQTQWPSNQD 

YDSIMAQTGLPRPEVVRWFGDSRYALKNGQLK 

WYEDYKRGNFPPGLLVIAPGNRELLQDYYMTHK 
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SEQID 
NO: 


Method 


Predicted 

beginiuDg 

nucleotide 

location 

corresponding 

to first aintoo 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue of 
peptide 
sequence 


Amino acid sequence (A^Alanine C°Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F^Phenylalanine, G=GIycine, H=^Histidine, 
I=Iso!eacine, KpLysinCt Lr=Leucine, M=Methionine, 
N-Asparagine, P»Proline, Q=Glutamine, R='Arginine, S=^erine, 
l^Tlireonine, VsVaUnc, W»Tryptopban, Y^Tyrosine, 
XBUnknown, *^top codon, possible nucleotide deletion, 
\-pos$ible nucleotide Insertion 










MLYEEDLQNLCDKTQMSSQQVKQWFAEKMGEE 

TRAVADTGSEDQGPGTGELTAVHKGMGDTYSE 

VSENSESWEPRVPEASSEPFDMSSPQAGRQLETD 


3448 


A 


2 


1324 " 


FVARAEKGFRTREAHLLQVAGVGTGLQNGASLS 

GLASGVMAQRAFPNPYADYNKSLAEGYFDAAG 

RLTPEFSQRLTNKIRELLQQMERGLKSADPRDGT 

GYTGWAGIAVLYLHLYDVFGDPAYLQLAHGYV 

KQSLNCLTKRSITFLCGDAGPLAVAAVLYHKMN 

NEKQAEIX:iTRLIHLNKIDPHAPNEMLYGRIGYIY 

ALLFWKNFGVEKIPQSHIQQICETILTSGENLAR 

KRNFTAKSPLMVFWYOFVYVOAAHGT AOTYYY 

LMQPSLQVSQGKLHSLVKPSVDYVCQLKFPSGN 

YPPCIGDNRDLLVHWCHG APG VIYMLIQA YKVF 

R/EREKYLC\DAYQCADVIWQYGLLKKGYGLCY\ 

GSAGNAYAFLTLYNLTQDMKYLYRACKFAEWC 

LEYGEHGCRTPDTPFSLFBGMAGHYFLVADLLFP 

TKAR\FPAFEL 


3449 


A 


3 


2389 


SRHVTGAARSPSRAGPSDPPAMGDEDDDESCAV 

ELRITEANLTGHEEKVSVENFELLKVLGTGAYGK 

VFLVRKAGGHDAGKLYAMKVLRKAALVQRAK 

TQEHTRTERSVLELVRQAPFLVTLHYAFQTDAKL 

HLBLDYVSGGEMFTHLYQRQYFKEAEVRVYGGE 

IVLALEHLHKLGHYRDLKLENVLLDSEGHIVLTD 

FGLSKEFLTEEKERTFSFCGTIEYMAPEIIRSKTGH 

GKAVDWWSLGILLFELLTGASPFTLEGERNTQAE 

VSRRILKCSPPFPPRIGPVAQDLLQRLLCKDPKKR 

LGAGPQGAQEVRNHPFFQGLDWVALAARKIPAP 

FRPQIRSELDVGXNFAEEFTRLEPVYSPPGQVPPPG 

DPRIFQGYSFVAPSILFDHNNAVMTDGLEAPGAG 

DRPGRAAVARSAMMQDSPFFQQYELDLREPALG 

QGSFSVCRRCRQRQSGQEFAVKILSRRLEANTQR 

EVAALRLCQSHPNVVNLHEVHHDQLHTYLVLEL 

LRGGELLEHIRKKRHFSESEASQILRSLVSAVSFM 

HEEAGWHRDLKPENILYADDTPGAPVKUDFG/F 

ESCDLWSLGVILY\MMLSGQAPFQGASGQGGQS 

QAAEIMCKIREGRFSLDGEAWQGVSEEAKELVR 

GLLTVDPAKRLKLEGLRGSSWLQDGSARSSPPLR 

TPDVLESSGPAVRSGLNATFMAFNRGKREGFFLK 

SVENAPLAKRRKQKLRSATASRRGSPAPANPGR 

APVASKGAPRRANGPLPPS 


3450 


A 


201 


1705 


KGTEMNKSRWQSRRRHGRRSHQQNPWFRLRDS 

EDRSDSRAAQPAHDSGHGDDESPSTSSGTAGTSS 

VPELPGFYFDPEKKRYFRLLPGHNNCNPLTKESIR 

QKEMDESKRLRLLQEEDRRKKIARMGFNASSMLR 

KSQLGFLNVTNYCHLAHELRLSCMERKKVQIRS 

MDPSALASDRFNLILADTNSDRLFTVNDVTVGGS 

KYGIINLQSLKTPTLKVFMHENLYFTNRKV\NSV 

CWASLNHLDSHILLCLMGLAETPGCATLLPASLF 

VNSHPAGIDRPG\MLCSFRIPGAWSCAWSLNIQA 

NNCFSTGLSRRVLLTNVVTGHRQSFGTNSDVLA 

QQFALMAPLLFNGCRSGEIFAIDLRCGNQGKGW 

KATRLFHDSAVTSVRILQDEQYLMASDMAGKIK 

LWDLRTTKCVRQYEGHVNEYAYLPLHVHEEEGI 

LVAVGQDCYTRIWSLHDARLLRTIPSPYPASKAD 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A"Alanlne C'=C)'steine, I>°Aspartic Acid, 
E=Glutaraic Acid, F^Phenylalanine, G^GIycine, H^Histidine, 
I»l5oleucine, K^Lysine, L^Leucine, IVf=>Methionine, 
N-Asparagine, P=Proline, Q=Glutainine, K'^^Arginine, S=^erlne, 
T«Tbreonine, V=VaHne, W=Tryptophan, Y=Tyrosinc, 
X^nknown, *»$top codon, A-possible nucleotide delcHon, 
\Fpossible nucleotide insertion 










IPSVAFSSRLGGSRGAPGLLMAVGQDLYCYSYS 


3451 


A 


19 


6033 


LLSAMLSHGAGLALWITLSLLQTGLAEPERCNFT 

LAESKASSHSVSIQWRILGSPCNFSLIYSSDTLGA 

ALCPTFRIDNTTYGCNLQDLQAGTIYNFKIISLDE 

ERTVVLQTDPLPPARPGVSKEKTTSTGLHVWWT 

PSSGKVTSYEVQLFDENNQKIQGVQIQESTSWNE 

YTFFNLTAGSKYNIAITAVSGGKRSFSVYTNGST 

VPSPVKDIGISTKANSLLISWSHGSGNVERYRLM 

LMDKGILVHGGVVDKHATSYAFHGLSPGYLYNL 

TVMTEAAGLQNYRWKLVRTAPMEVSNLKVTND 

GSLTSLKVKWQRPPGXNVDSYNITLSHKGTIKESR 

VLAPWIT\ETHFKELVPGRLY\QVTCSAVSLGELS 

AQKM\AVGRTFPDKVANLEANNNGRMRSLVVS 

WSPPAGDWEQYRILLFNDSWLLNTTVGKEETQ 

YVMDGTGLVPGRQYEVEVrVESGNLKNSERCQG 

RTVPLAVLQLRVKHANETSLSIMWQTPVAEWEK 

YIISLADRDLLLIHKSLSKDAKEFTFTDLVPGRKY 

MATVTSISGDLKNSSSVKGRTVPAQVTDLHVAN 

QGMTSSLFTNWTQAQGDVEFYQVLLIHENVVIK 

NESISSETSRYSFHSLKSGSLYSVWTTVSGGISSR 

QVVVEGRTVPSSVSGVTVNNSGRNDYLSVSWLL 

APGDVDNYEVTLSHDGKVVQSLVIAKSVRECSF 

SSLTPGRLYTVTrriRSGKYENHSFSQERTVPDKV 

QGVSVSNSARSDYLRVSWVHATGDFDHYEVTIK 

NKNNFIQTKSIPKSENECVFVQLVPGRLYSVTVT 

TKSGQYEANEQGNGRTIPEPVKDLTLRNRSTEDL 

HVTWSGANGDVDQYEIQLLFNDMKVFPPFHLVN 

TATEYRFTSLTPGRQYKILVLTISGDVQQSAFIEG 

FTVPSAVK^IfflSPNGATDSLTVNWTPGGGDVDS 

YTVSAFRHSQKVDSQTIPKHVFEHTFHRLEAGEQ 

YQIMIASVSGSLKNQINVVGRTVPASVQGVIADN 

AYSSYSLIVSWQKAAGVAERYDILLLTENGILLR 

NTSEPATTKQJIKFEDLTPGKKYKIQILTVSGGLFS 

KEAQTEGRTVPAAVTDLRITENSTRHLSFRWTAS 

EGELSWYNIFLYNPDGNLQERAQVDPLVQSFSFQ 

NLLQGRMYKMVIVTHSGELSNESFIFGRTVPASV 

SHLRGSNRNTTDSLWFNWSPASGDFDFYELILYN 

PNGTKKENWKDKDLTEWRFQGLVPGRKYVLW 

VVTHSGDLSNKVTAESRTAPSPPSLMSFADIANT 

SLAITWKGPPDWTDYNDFELQWLPRDALTVFNP 

YNNRKSEGRIVYGLRPGRSYQFNVKTVSGDSWK 

TYSKPIFGSVRTBCPDKIQNLHCRPQNSTAIACSWl 

PPDSDFDGYSIECRKMDTQEVEFSRKLEKEKSLL 

NIMMLVPHKRYLVSIKVQSAGMTSEV VEDSTIT 

MmRPPPPPPHIRVNEKDVLISKSSINFTVNCSWFS 

DTNGAVKYFTWVREADGSDELKPEQQHPLPSY 

LEYRHNASIRVYQTNYFASKCAENPNSNSkSFNI 

KLGAEMESLGGKCDPTQQKFCDGPLKPHTAYRI 

LFGAIEGVSAGLFLIGMLVAVVALLICRQKVSHG 

RERPSARLSIRRDRPLSVHLNLGQKGNRKTSCPIK 

INQFEGHFMKLQADSNYLLSKEYEELKDVGRNQ 

SCDIALLPENRGKNRYNNILPYDATRVKLSNVDD 

DPCSDYINASYIPGMNFRREYIVTQGPLPGTKDDF 

WKMVWEQNVHNIVMVTQCVEKGRVKCDHYW 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residacof 
peptide 
sequence 


Amino acid sequence (A«Alanine C'^Cysteine, I>»Aspartic Acid, 
E^Glutamic Acid» F~Phenylalanine) G=Clycine, H=Histidine, 
I— Isoieucine, K=Lysine, L^Leucine, M«Metliionine, 
N»Asparagine, P=Proline, Q=GIutamiae, R-Afginine, S»Serine, 
T"Threonlne, V^Valine, W»Tryptophan, Y-Tyrosine, 
X=Unluiown, *»Stop codon, /s'possible nucleotide deletion, 
V^possible nucleodde insertion 










PADQDSLYYGDLILQMLSESVLPEWTIREFKICGE 

EQLDAHRLIRHFHYTVWPDHGVPETTQSLIQFVR 

TVRDYINRSPGAGPTVVHCSAGVGRTGTFIALDR 

ILQQLDSKDSVDIYGAV\HDLRLHRVHMVQTEC 

QYVYLHQCVRDVLRARKLRSEQENPLFPIYENV 

NPEYHRDPVYSRH 


3452 


A 


63 


1073 


FFRSSSDNGSPIRQYE/HSTPAHQGPVMGLEGKS/ 

ARNSQLRIVLVGKTGAGKSATGNSILGRKVFHSG 

TAAKSITKKCEKRSSSWKETELVVVDTPGIFDTE 

VrrMAc loisJbllKUlL«L 1 SrOriiALLLV VFLOKY IbJa 

EHKATEKILKMFGERARSFMILIFIRKDDLGDTN 

LHDYLREAPEDIQDLMDBFGDRYCALNNKATGA 

EQEAQRAQLLGLIQRVVRENKEGCYTNRMYQR 

AEEEIQKQTQAMQELHRVELEREKARIREEYEEK 

IRKLEDKVEQEKRKKQMEKKLAEQEAHYAVRQ 

QRARTEVESKDGELELIMTALQIASFILLRLFAED 


3453 

• 


A 


2674 


514 


GPimiaKKAKMKDMPLRIH\a.LGLAITTLVQAV 

DKKVDCPRLCTCEIRPWFTPRSIYMEASTVDCND 

LGLLTFPARLPANTQILLLQTNNIAKffiYSTDFPV 

NLTGLDLSQNNLSSVTOTNGKKMPQLLSVYLEEN 

KLTELPEKCLSELSNLQELYINHNLLSTISPGAFIG 

LHNLLRLHLNSNRLQMINSKWFDALPNLEILMIG 

ENPIIRIKDMNFKPLINLRSLVIAGINLTEIPDNAL 

VGLENLESISFYDNRLIKVPHVALQKWNLKFLD 

LNKNPINRIRRGDFSNMLHLKELGINNMPELISID 

SLAVDNLPDLRKIEATNNPRLSYIHPNAFFRLPKL 

ESLMLNSNALSALYHGTIESLPNLKEISIHSNPIRC 

DCVIRWMNMNKTNIRFMEPDSLFCVDPPEFQGQ 

NVRQVHFRDMMEICLPLIAPESFPSNLNVEAGSY 

VSFHCRATAXEPQPEIYWITPSGQKLLPNmiDKF 

YVHoiiu iLrUlNOV IFJvbCjCjLY iClAiNLYCjAJJLJs. 

SVMIKVDGSFPQDNNGSLNIKIRDIQANSVLVSW 

KASSIGLKSSVKWTAFVKTENSHAAQSARIPSDV 

KVYNLTHLNPSTEYKICmiPTIYQKimKKCVNVT 

TKGLHPDQKEYEKNNTTTLMACLGGLLGnGVIC 

LISCLSPEMNCDGGHSYVRNYLQKPTFALGELYP 

PLINLWEAGKEKSTSLKVKATVIGLPTNMS 


3454 


A 


1844 


244 


ERYLFATYVAPSATLDIGLQQEKKKEIYMKIQPP 

FEDLFDTAEEYILLLLLEPWTKMVKSDQIAYKKV 

ELVEETRQLDSTYFRKLQALHKETFSKBCAEDTTC 

EIGTGILSLSNVSKRTEYWDNVPAEYKHFKFSDL 

LNNKLEFEHFRQFLETHSSSMDLMCWTDIEQFRR 

ITYRDRNQRKAKSIYIKNKYLNKKYFFGFNSPAS 

LYQQNQVMHLSGGWGKILHEQLDAPVLVEIQK 

HVQNRLENVWLPLFLASEQFAARQKIKVQMKDI 

AEELLLQKAEKKIGVWKPVESKWISSSCKIIAFRK 

ATI "KTDWTCD rWCYD IJ\/ A 1 VfiTM T CKT/^T T T7"tt7nT?\/A 

ALiLiNIr V 1 oKl^r vKr VALrKvjJJLL>CrIMuJ^J^r W V^JS V Vi/ 

KYKDLCHSHCDESVIQKKITTIINCFINSSIPPALQI 
DTPVFOAOKTIFHRKFT GPWFRFAOVfTFLGVMF 

KFWPQFCEFRKNLTDENIMSVLERRQEYNKQKK 
KLAVL/QNDEKSGKDGKQYANTSVPAIKTALLS 
DSFLGLQPYGRQPTWCYSKYIEALEQERJLLKIQE 
ELEK\SCLQACNLSQILRLALQLCL 


3455 


A 


228 


3330 


APTAQAMMSFGGADALLGAPFAPLHGGGSLHY 
ALARXGGAGGTRSAAGSSSGFHSWTRTSVSSVS 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue of 
peptide 
sequence 


Amino acid sequence (A=^AIanine C=Cysteine, I>=A5partic Acid, 
£=Glutamic Acid, F=Phenylalanine, G'^GIycine, H»Histidine, 
I==Isoleucine, K=Lysine, i^Leucine, M=IVfethionine, 
N^Asparagine, P^Proline, Q^GIutamine, R=Arginine, S=Serine, 
T»Threonine, V=VaIlne, W=Tryptophan, Y=Tyrosine, 
X-Unknown, ^^^top codon, /^possible nucleotide deletion, ' 
\=possible nucleotide insertion 










ASPSRPRGAGAASSTDSLDTLSNGPEGCMVAVA 

TSRSEKEQLQALNDRFAGYIDKVRQLEAHNRSLE 

GEAAALRQQQAGRSAMGELYEREVREMRGAVL 

RLGAARGQLRLEQEHLLEDIAHVRQRLDDEARQ 

REEAEAAARALARFAQEAEAARVDLQKKAQAL 

QEECGYLRRHHQEEVGELLGQIQGSGAAQAQM 

QAETRDALKCDVTSALREIRAQLEGHAVQSTLQ 

SEEWFRVRLDRLSEAAKVNTDAMRSAQEEITEY 

RRQLQARllKLEALKSTKDSLERQRSELEDRHQA 

DIASYQEAIQQLDAELRNTKWEMAAQLREYQDL 

LNVKMALDIEIAAYRKLLEGEECRIGFGPIPFSLP 

EGLPKDPSVSTHIKVKSEEKIKVVEKSEKETVIVEE 

QTEETQVTEEVTEEEDKEAKEEEGKEEEGGEEEE 

AEGGEEETKSPPAEEAASPEKEAKSPVKEEAKSP 

AEAKSPEKEEAKSPAEVKSPEKAKSPAKEEAKSP 

PE\AKSPEKDGKQNFQAEVKSPEKAKSPAKEEAK 

SPAEAKSPEKAKSPVKEEAKSPAEAKSPVKEEAK 

SPAEVKSPEKAKSPTKEEUKSPEKAKSPEKAKSP 

EKEEAKSPEKAKSPVKAEAKSPEKAKSPVKAEA 

KSPEBCAKSPVKEEAKSPEKAKSPVKEEAKSPEKA 

KSPVKEEAKTPEKAKSPVKEEAKSPEKAKSPEKA 

KTLDVKSPEAKTPAKEEARSPADKFPEKAKSPVK 

EEVKSPEKAKSPLKEDAKAPEKEIPKKEEVKSPV 

KEEEKPQEVKVKEPPKKAEEEKAPATPKTEEKK 

DSKKEEAPKKEAPKPKVEEKKEPAVEKPKESKV 

EAKKEEAEDKKKVPTPEKEAPAKVEVKEDAKPK 

EKTEVAKKEPDDAKAKEPSKPAEKKEAAPEKKD 

TKEEKAKKPEEKPKTEAKAKEDDKTLSKEPSKP 

KAEKAEKSSSTDQKDSKPPEKATEDKAAKGK 


3456 


A 


258 


1463 


YLSFIPGHASKSAPMNGHCFAENGPSQKSSLPPLL 

IPPSENLGPHEEDQVVCGFKKLTVNGVCASTPPL 

TPIKNSPSLFPCAPLCERGSRPLPPLPISEALSLDDT 

DCEVEFLTSSDTDFLLEDSTLSDFKYDVPGVRRSF 

RGCGQINYAYFDTPAVSAADLSYVSDQNG\GVP 

DPNPrrPQ 1 HRRLRRSHSGPAGSrNKPAlKlSNCCl 

HRASPNSDEDKPEVPPRVPIPPRPVKPDYRRWSA 

EVTSSTYSDEDRPPKVPPREPLSPSNSRTPSPKSLP 

SYLNGVMPPTQSFAPDPKYVSSKALQRQNSEGS 

ASKVPCELPIIENGKKVSSTHYYLLPERPPYLDKY 

EKFFREAKKKNGGAQIQPLPADCGISSATEKPDS 

KTKMDLGGHVKRKHLSYVGTP 


3457 


A 


2 


4869 


FILSSSSSASSEHFHHHYSFGNWWPGSFKGHRMS 

LPFYQRCHQHYDLSYRNKDVRSTVSHYQREKKR 

SAVYTQGSTAYSSRSSAAHRRESEAFRRASASSS 

QQQASQHALSSEVSRKAASAYDYGSSHGLTDSS 

LLLDDYSSKLSPKPKRAKHSLLSGEEKENLPSDY 

MVPIFSGRQKHVSGITDTEEERIKEAAAYIAQRNL 

LASEEGITTPKQSTASKQTTASKQSTASKQSTASK 

RKVVIRGKAERLSLRKTLEETETYHAKLKEDHLL 

HAPEFIIKPRSHTVWEKENVKLHCSIAGWPEPRV 

TWYKNQVPINVHANPGKYIIESRYGMHTLEINAC 

DFEDTAQYRASAMNVKGELSAYASVVVKRYKG 

EFDETRFHAGASTMPLSFGVTPYGYASRFEfflFD 

DKFDVSFGREGETMSLGCRVVITPEIKHFQPEIQ 
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SEQiD 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=AIanine C^CysteinCi D^Aspartic Acid, 1 
C=Glutaniic Acid, F^Phenylalanine, G»Gtycine, H^HisHdlne, 
I=Isoleucine, IC=Lysine, L^l^eucine, M=Methionine, 
N^Asparaginc* P«»ProIine, Q=GIutamine, R=Arginine, S=Scrine, 
T«Threotiine, V«Valinei W«Tryptoplian, Y=Tyrosine, 
X=UnknowD> *«Stop codon, /"possible nucleotide dcletioii» 
V°possible nucleotide insertion 










WYRNGVPLSPSKWVQTLWSGERATLTFSHLNKE 

DEGLYTIRVRMGEYYEQYSAYVFVRDADAEIEG 

APAAPLDVKCLEANKDYinSWKQPAVDGGSPIL 

GYFEDKCEVGTDSWSQCNDTPVKFARFPVTGLIE 

GRSYEFRVRAVNKMGIGFPSRVSEPVAALDPAEK 

ARLKS/PPLSTLDWTyVIVTEEEPSEGIVPGPPTDLS 

VTEATRSYWLSWKPPGQRGHEGIMYFVEKCEA 

GTENWQRVNTELPVKSPRFALFDLAEGKSYCFR 

VRCSNSAGVGEPSEATEVTVVGDKLDIPKAPGKJ 

IPSKNTDTSVVVSWEESKDAKELVGYYIEANVA 

GSGKWEPCNNNPVKTHRFTCHGLVTGQSYIFRV 

RAVNAAGLSEYSQDSEAIEVKAAIAPPSPPCDITC 

LESFRDSMVLGWKQPDKIGGAEITGYYVNYREV 

roOVPGKWREANVKAVSEEAYKISNLKENMVY 

QFQVAAMNMAGLGAPSAVSECFXCEEWTIAVP 

GPPHSLKCSEVRKDSLVLQWKPPVHSGRTPVTG 

YFVDLKEAKAKEDQWRGLNEAAIKNVYLKVRG 

LKEGVSYVFRVRAINQAGVGKPSDLAGPVVAET 

RPGTKEVWNVDDDGVISLNFECDKMTPKSEFS 

WSKDYVSTEDSPRLEVESKGNKTKMTFKDLGM 

DDLGIYSCDVTDTDGIASSYLIDEEELKRLLALSH 

EHKFPTVPVKSELAVEILEKGQVRF\WMQAEKLS 

GNAKAHSTYlFNEKGIFEGPKYKNfHIDRNTGIIEMF 

MEKLQDEDEGTYTFQLQDGBCATNHSTWLVGD 

VFKKLQKEAEFQRQEWIRKQGPHFVEYLSWEVT 

GEChTVLLKCKVANIKKETHlVWYKDEREISVDE 

KHDFKDGICTLLITEFSKKDAGIYEVILKDDRGK 

DKSRLKLVDEAFKELMMEVCKKIALSATDLKTQ 

o lAiiLjll^L Ybr V 1 1 Y VJc.jJJ-rIvViN WolilNLroAlKYoU 

RVKTGVTGEQIWLQINEPTPNDKGKYVMELFDG 

KTGHQKTVDLSGQAYDEAYAEFQRLKQAAIAEK 

NRARVLGGLPDWTIQEGKALNLTCNVWGDPPP 

EVSWLKNEKALASDDHCNLKFEAGRTAYFTING 

VSTADSGKYGLWKNKYGSETSDFTVSVFIPEEE 

ARMAALESLKGGKKAK 


3458 


A 


3963 


827 


LSRSS SDNNTNTLGRNVMSTATSPLMG AQSFPNL 

TTPGTTSTVTMSTSSVTSSSNVATATTVLSYGQS 

LSNTLTTSLTSTSSESDTGQEAEYSLYDFLDSCRA 

STLLAELDDDEDLPEPDEEDDENEDDNQEDQEY 

EEVMILRRPSLQRRAGSRSDVTHHAVTSQLPQVP 

AGAGSRJPIGEQEEEEYETKGGRRRTWDDDYVLK 

RQFSALVPAFDPRPGRTNVQQTTDLEIPPPGTPHS 

ELLEEVECTPSPRLALTLKVTGLGTTREVELPLTN 

FRSHFYYVQKLLQLSCNGNVKSDKLRRIWEPTY 

TIMYREMKDSDKEKENGKMGCWSBHVEQYLG 

TDELPKNDLITYLQKNADAAFLRHWKLTGTNBCS 

IRKNRNCSQUAAYWDLG\EHGTK\SQLNQGAIST 

LQSSDILKLTKEQPQAKAGNGQNSCGVEDVLQL 

LRILYIVASDPYSRISOEDGDEOPOFTFPPDEFTS/ 

mTTKILQQIEEPLALASGALPDWCEQLTSKCPF 

LIPFETRQLYFTCTAFGASRAIVWLQNRREATVE 

RTRTTSSVRRDDPGEFRVGRLKHERVKVPRGESL 

MEWAENVMQIHADRKSVLEVEFLGEEGTGLGPT 

LEFYALVAAEFQRTDLGAWLCDDNFPDDESRHV 

DLGGGLKPPGYYVQRSCGLFTAPFPQDSDELERI 
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SEQJB 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A-Alaaine OCysteine, I>=Aspar(ic Acid, 
E^CIutamic Acid, F=Phenylalanine, G=Glycine, H^Histidlne, 
Is'Isoleucine, K«=Ly$ine» L^Leucine, M^Methionine, 
N=A5paragine, P»Proline, Q^GIutamine, R»Arginine, S=>Serine, 
T»Thrconine, V=Valine, W=Tryptophan, Y^Tyroslne, 
X»lJnknowii, *=Stop codon, /=possible nucleotide deletion, 
\-possible nucleotide insertion 










TKLFHFLGIFLAKCIQDNRLVDLPISKPFFKLMCM 

GDIKSNMSKLIYESRGDRDLHCTESQSEASTEEG 

HDSLSVGSFEEDSKSEFILDPPKPKPPAWFNGILT 

WEDFELVNPHRARFLKEIKDLAIKRRQILST^KGL 

SEDEKNTKLOELVLKNPSGSGPPLSIEDLGLMFOF 

CPSSRIYGFTAVDLKPSGEDEMITMDNAEEYVDL 

MFDFCMHTGiQKQMEAFRDGFNKVFPMEKLSSF 

SHEEVQMILCGNQSPSWAAEDIINYTEPKLGYTR 

DSPGFLRFVRVLCGMSSDERKAFLQFTTGCSTLP 

PGGLANLHPRLTVVRKVDATDASYPSVNTCVHY 

LKLPEYSSEEIMRERLLAATMEKGFHLN 


3459 


A 


88 


603 


SCGPRGLASLGLGFSGRCDDQNKGRS\DGPEAQA 

EACSGERTYQELLVNQNPIAQPLASRRLTRKLYK 

CIKKAVKQKQmRGVKEVQKFVNKGEKGIMVLA 

GDTLPIEVYCHLPVMCEDRNLPYVYIPSKTDLGA 

AAGSKRPTCVIMVKPHEEYQEAYDECLEEVQSL 

PLPL 


3460 


A 


139 


1997 


qvtnmsdkselkaelerkkqrlaqireekkrke 

eerkkketdqkkeavapvqeesdlekkrreaea 

llqsmgltpespivpppmspssksvstpseagsqd 

sgdgavgsrrgpiklgmakitqvdfppreivtyt 

ketqtpvmaqpkedeeedddwapkppiepeeek: 

tlkkdeen\dskapphelteeekqqilhseeflsff 

DHSTRIVERALSEQINIFFDYSGRDF/ENDKEGEIQ 

agabclslnrqff\der\wskasgwvscldwssq 

yp\ellvasynnnedaphepdgvalvwnmk:yk 

kttpeyvfhcqsavmsatfakfhpnlvvggtys 

gqivlwdnrsnkrtpvqrtplsaaahthpvycv 

nwgtqnahnlisistdgkicswsldmlshpqds 

gsvytacrhgskagisemfeghqgpitgihchaa 

vgavdfshlyvtssfdwtvklwttkknkplysf 

ednagyvydvmwspthpalfacvdgmgrldl 

wnlnndtevptasisvegnpalnrvrwthsgre 

lavgdsegqiviydvgeqiavprndewarfgrtl 

aeinanradaeeeaatripa 


3461 


A 


139 


1997 


qvtnmsdkselkaelerkkqrlaqireekjgrice 

eerkkketdqkkeavapvqeesdlekkrreaea 

llqsmgltpespivpppmspssksvstpseagsqd 

sgdgavgsrrgpiklgmakitqvdfppreivtyt 

ketqtpvmaqpkedeeedddvvapkppiepeeeic 

tlkkdeen\dskapphelteeekqqe.hseeflsff 

dhstriveralseqinffdysgrdf/endkegeiq 

agaklslnrqff\der\wskasgwvscldwssq 

yp\ellvasynnnedaphepdgvalvwnmk:yk 

kttpeyvfhcqsavmsatfabcfhpnlvvggtys 

gqivlwdnrsnkrtpvqrtplsaaahthpvycv 

nwgtqnahnlisistdgkicswsldmlshpqds 

MELVHKQSKAVAVTSMSFPVGDVNNFVVGSEE 

GSVYTACRHGSKAGISEMFEGHQGPITGIHCHAA 

VGAVDFSHLYVTSSFDWTVKLWTTK2WKPLYSF 

EDNAGYVYDVMWSPTHPALFACVDGMGRLDL 

WNLNNDTEVPTASISVEGNPALNRVRWTHSGRE 

IAVGDSEGQIVIYDVGEQIAVPRNDEWARFGRTL 

AEINANRADAEEEAATRIPA 
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I SEQID 
NO: 


Method 


Predicted 
beginning 
nucleotide 

lOCDtlOO 

corresponding 
to first amino 
add residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C'K^ysteine, D^Aspartic Acid, 
^Glutamic Acid, F<»PhenyIalanine, G°Glycine, H^Histidine, 
I=Isoleucine, K»Lysine, L^Leucine, M«Metfaionine, 
N=Asparagine, P=Proline, 0=Glutamine, R»Aigialne; S^^rine, 
T=>Threonine, V=Valine, W-Tryptopban, Y=Tyrosine, 
X^Unlcnown, *^top codon, A>possible nucleotide deletion, 
\»pos8ible nucleotide insertion 


3462 


A 


2 


2643 


TAPEFSRSTHASAHASVARVLRNREIAQLKKEQR 

RQEFQIRALESQKRQQEMVLRRKTQEVSALRRL 

AKPMSERVAGRAGLKPPMLDSGAEVSASTTSSE 

AESGARSVSSrVRQWNRKINHFLGDHPAPTVKGT 

RPARKKFQKKGASQSFSBCAARLKWQSLERRUDI 

VMQRMTIVIUEADMERLIKKREELFLLQEALRR 

KRERLQAESPEEEKGLQELAEEIEVLAANIDYIND 

GITDCQATIVQLEETKEELDSTDTSVVISSCSLAE 

ARLLLDNFLKASIDKGLQVAQKEAQIRLLEGRLR 

QTDMAGSSQNHLLLDALREKAEAHPELQALIYN 

VQQENGYASTDEEISEFSEGSFSQSFTMKGSTSH 

DDFKFKSEPKLSAQMKAVSAECLGPPLDISTKNI 

TKSLASLVEIKEDGVGFSVRDPYYRDRVSRTVSL 

PTRGSTFPRQSRATETSPLTRRKSYDRGQPIRSTD 

VGFTPPSSPPTRPRNDRNVFSRLTSNQSQGSALD 

KSDDSDSSLVSEVLRGnSPVGGAKGARTAPLQCV 

SMAEGHTKPILCLDATDELLFTGSKDRSCKMWN 

LVTGQEIAALKGHPNNWSnCYCSHSGLVFSVST 

SYIKVWDIRDSAKCIRTLTSSGQVISGDACAATST 

WELSRFQPVGKLTGHIGPVMCLTVTQTASQHDL 

VVTGSKDHYVKMFELGECVTGTIGPTHNFEPPH 

YDGIECLAIQGDILFSGSRDNGIKKWDLDQQELIQ 

QIPNAHKDWVCALAFIPGRPMLLSACRAGVIKV 

WNNODNFTPIGEIKGHDSPINAICTNAKHIFTASSG 

CRVKVWNYVPGLTPCLPRRVLAIKGRATTLP 


3463 


A 


198 


3146 


SGEPRPEPGNMATCIGEKIEDFKVGNLLGKGSFA 

GVYRAESfflTGLEVAIKMIDKKAMYKAGMVQR 

VQNEVKIHCQLKHPSILELYNYFEDSNYVYLVLE 

MCHNGEMNRYLKNRVKPFSENEARHFMHQIITG 

MLYLHSHGILHRDLTLSNLLLTRNMNIKIADFGL 

ATQLKMPHEKHYTLCGTPNYISPEIATRSAHGLE 

SDVWSLGCMFYTLLIGRPPFDTDTVKNTLMCVV 

LADYEMPTFLSIEAKDLIHQLLRRNPADRLSLSSV 

LDHPFMSRNSSTKSKDLGTVEDSIDSGHATISTAI 

TASSSTSISGSLFDl<JWU.LIGQPLPNmTVFPKNK 

SSTDFSSSGDGNSFYTQWGNQETSNSGRGRVIQD 

AEERPHSRYLRRAYSSDRSGTSNSQSQAKTYTM 

ERCHSAEMLSVSKRSGGGENEERYSPTDNNANIF 

NFFKEKTSSSSGSFERPDNNQALSNHLCPGKTPFP 

FADPTPQTETVQQWFGNLQINAHLRK'1'J'HYDSIS 

PNRDFQGHPDLQKDTSKNAWTDTKVKKNSDAS 

DNAHSVKQQNTMKYMTALHSKPEnQQECVFGS 

DPLSEQSKTRGMEPPWGYQNRTLRSITSPLVAHR 

LKPIRQKTKKAVVSILDSEEVCVELVKEYASQEY 

VKEVLQISSDGNTITrYYPNGG\RGFPLA\DRPPSP 

T\DNISR\YSF\DNLPEKYWRKYQYASRFVQLVRS 

KSPKITYFTRYAKCILMENSPGADFEVWFYDGV 

KIHKTEDFIOVIEKTGKSYTLKSESEVNSLKEEIK 

MYMDHANEGHRJCLALESnSEEERKTRSAPFFPn 

IGRKPGSTSSPKALSPPPSVDSNYPTRDRASFNRM 

VMHSAASPTQAPILNPSMVTNEGLGLTTTASGTD 

ISSNSLKDCLPKSAQLLKSVFVKNVGWATQNLTS 

GAVWVQFNDGSQLVVQAGVSSISYTSPNGQ\TTR 

WGENEKLPDYIKQKLQCLSSILLMFSNPTPNrFH 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue of 
peptide 
sequence 


Amino acid seauence (A^^Alanine C^Cvsteinei D^Asoartic Acid. 
£=GlQtaniic Add, F=Phenyl alanine, G^GIycine, H^Histidine, 
I-Isoleudne, K-Lysine, L=Leudne, M=IVIethionine, 
N=Asparagine, P^'Proliney Q^Glutamine, R^Arginine, S=Serine, 
T«Threonine, V=Vallne, W=Tryptophan, V=Tyrosine, 
XsUnknown, *»Stop codon, ^possible nucleotide deletion, 
V^ssible nucleotide insertioa 


3464 


A 


14 


348 


AVRTVSGTSLGPRSHSRSPGRCHCFSAVTFSSPRL 
AASEAPDPMEEWDVPQMKKEVESLKYQLAFQR 
EMASKTIPELLKWIEDGIPKDPFLNPDLMKNNPW 
V\EKGKCTIL 


3465 


A 


5537 

4 


405 


VRKLDRERVGAWWRGAWARHPRQEAGEHAKR 

RKGHAETPRGRRKGRAGRSAAAVGELRPARRSL 

ETSRAAAAMAKDSPSPLGASPKKPGCSSPAAAV 

LENQRRELEKLRAELEAERAGWRAERRRFAARE 

RQLREEAERERRQLADRLRSKWEAQRSRELRQL 

QEEMQREREAEIRQLLRWKEAEQRQLQQLLHRE 

RDGWRQARELQRQLAEELVNRGHCSRPGASEV 

SAAQCRCRLQEVLAQLRWQTDGEQAARIRYLQ 

AALEVERQLFLKYILAHFRGHPALSGSPDPQAVH 

SLEEPLPQTSSGSCHAPKPACQLGSLDSLSAEVG 

VRSRSLGLVSSACSSSPDGLLSTHASSLDCFAPAC 

SRSLDSTRSLPKASKSEERPSSPDTSTPGSRRLSPP 

PSPLPPPPPPSAHRKLSNPRGGEGSESQPCEVLTPS 

PPGLGHHELIKLNWLLAKALWVLARRCYTLQEE 

NKQLRRAGCPYQADEKVKRLKVKRAELTGLAR 

RLADRARELQETNLRAVSAPIPGESCAGLELCQV 

FARQRARDLSEQASAPLAKDKQIEELRQECHLLQ 

ARVASGPCSDLHTGRGGPCTQWLNVRDLDRLQ 

RESQREVLRLQRQLMLQQGNGGAWPEAGGQSA 

TCEEVRRQMLALERELDQRRRECQELGAQAAPA 

RRRGEEAETQLQAALLKNAWLAEENGRLQAKT 

DWVRKVEAENSEVRGHLGRACQERDASGLIAEQ 

LLQQAARGQDRQQQLQRDPQKALCDLHPSWKEI 

QALQCRPGHPPEQPWETSQMPESQVKGSRRPKF 

HARAEDYAVSQPNRDIQEKREASLEESPVALGES 

ASVPQVSETVPASQPLSKKTSSQSNSSSEGSMWA 

TVPSSPTLDRDTASEVDDLEPDSVSLALEMGGSA 

APAAPKLKIFMAQYNYNPFEGPNDHPEGELPLTA 

GDYIYIFGDMDEDGFYEGELEDGRRGLVPSNFVE 

QIPDSYIPGCLPAKSPDLGPSQLPAGQDEALEEDS 

LLSGKAQGVVDRGLCQMVRVGSKTEVATEILDT 

KTEACQLGLLQSMGKQGLSRPLLGTKGVLRMAP 

MQLHLQNVTATSANITWVYSSHRHPHVVYLDD 

REHALTPAGVSCYTFQGLCPGTHYRARVEVRLP 

RDLLQVYWGTMSSTVTFDTLLAGPPYPPLDVLV 

ERHASPGVLVVSWLPVTIDSAGSSNGVQVTGYA 

VYADGLKVCEVADATAGSTLLEFSQLQVPLTWQ 

KVSVRTMSLCGESLDSVPAQIPEDFFMCHRWPET 

PPFSYTCGDPSTYRVTFPVCPQKLSLAPPSAKASP 

HNPGSCGEPQAKFLEAFFEEPPRRQSPVSNLGSE 

GECPSSGAGSQAQELAEAWEGCRKDLLFQKSPQ 

NHRPPSVSDQTGEKENCVQHMGTSKSPAPGFIHL 

RTECGPRKEPCQEKAALERVLRQKQDAQGFTPP 

QLGASQQYASDFHNVLKEEQEALCLDLWGTERR 

PSAKVnCMPRGGPQQLGTGAKTPARVFVALSDY 

NPLVMSANLKAAEEELVFQKRQLLRVWGSQDT 

HDFYLSECNRQVGNIPGRLVAEMEVGTEQTDRR 

WRSPAQGHLPSVAHLEDFQGLTIPQGSSLVLQGN 

SKRLPLWTPKIMIAALDYDPGDGQMGGQGKGRL 

ALRAGDVVMVY\GPMDDQGFYYGELGGHRG\L 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A'^AIanine C^Cysteine, D^-Aspartic Acid, 
£=GIutamic Acid, F^Phenylalanine, G=Glycine, H^Histidlne, 
I— Isoleucinei K=Lysine, li=Leucine» M=Metbionine, 
N=Asparagine» P=ProIine, QsGlutamine, R»Arginine, &=Seriae, 
T"Tbreonine, V^Vailne, W=Tryptophan, y«Tyrosine, 
X^Unltnown, *=Stop codon, possible nucleotide deletion) 
\ppos5ible nucleotide insertion 










VPANLRIKMSSQGH 


3466 


A 


1 


1111 


MSKPPDLLLRLLRGAPRQRVCTLFIIGFKFTFFVSI 

MIYWHWGEPKEKGQLYNLPAEIPCPTLTPPTPP 

SHGPTPGNIFFLETSDRTNPNFLFMCSVESAARTH 

PESHVLVLMKGLPGGNASLPRHLGISLLSCFPNV 

OMLPLDLRELFRDTFLADWYAAVOGRWEPYLL 

PVLSDASRIALMWKFGGIYLDTDFIVLKNLRNLT 

NVLGTQSRYVLNGAFLAFERRHEFMALCMRDFV 

DHYNGWIWGHQGPQLLTRVFKKWCSIRSLAESR 

ACRGVTTLPPEAFYFIPWQDWKKYFEDINPEELP 

RLLSATYAVHVWNKKSQGTRFEATSRALLAQLH 

ARYCPTTHE/DHENVLVKGPAGHLPNLLLMGHW 


3467 


A 


1 


2175 


MAKVILKQSKQCKNLLTCKVAQVCPVCGCLHC 

YFWWLSGLESRRPSSPLIDIKPIEFGVLSAKKEPIQ 

PSVLRRTYNPDDYFRKFEPHLYSLDSNSDDVDSL 

TDEEILSKYQLGMLHFSTQYDLLHNHLTVRVIEA ■ 

RDLPPPISHDGSRQDMAHSNPYVKICLLPDQKNS 

KQTGVKRKTQKPVFEERYTFEIPFLEAQRRTLLL 

TVVDFDKFSRHCVIGKVSVPLCEVDLVKGGHW 

WKAHDSQFSAPGLPADQQFFADLFSGLVL>JPQL 

LGRVWFASQPASLPVGSLCIDFPRLDIVLRGEYG 

NLLEAKQQRLVEGEMLFIPARAANLPVNNKPVM 

LLSLVFAPTWLGLSFYDSRTTSLLHPARQIQLP\SL 

QRGEGEAMLSVALTLFSRSPLEQNIIQPLVLSLLHL 

CGSWNMPPGNSQPRGDFLYHSICTWVQDNYAQ 

PLTRESVAQFFNITPNHLSKLFAQHGTMRFIEYVR 

WVRMAKARMILQKYHLSIHEVAQRCGFPDSDYF 

CRVFRROFOMDYVDIT OTHRWDYNTPlFFTr FAT 

NDWKAGKARYIGASSMHASQFAQALELQKQH 

GWAQFVSMQDHYNLIYREEEREMLPLCYQEGV 

AVIPWSPLARGRLTRPWGETTARLVSDEVGKNL 

YKESDENDAQIAERLTGVSEELGATRAQVALAW 

LLSKPGIAAPIIGTSREEQLDELLNAVDITLKPEQI 

AELETPYKPHPVVGFK 


3468 


A 


147 


3209 


ALPLPLPTLYPGMSRRKQRKPQQLISDCEGPSASE 

NGDASEEDHPQVCAKCCAQFTDPTEFLAHQlSfAC 

STDPPVMVIIGGQENPNNSSASSEPRPEGPINNPQ 

VMDTEHSNPPDSGSSVPTDPTWGPERRGEESSGH 

FLVAATGTAAGGGGGLILASPKLGATPLPPESTP 

APPPPPPPPPPPGVGSGHLNIPLILEELRVLQQRQI 

HQMQMTEQICRQVLLLGSLGQTVGAPASPSELP 

GTGTASSTKPLLPLFSPIKPVQTSKTLASSSSSSSS 

SSGAETPKQAFFHLYHPLGSQHPFSAGGVGRSHK . 

PTPAPSPALPGSTDQLIASPHLAFPSTTGLLAAQC 

LGAARGLEATASPGLLKPKNGSGELSYGEVMGP 

LEKPGGRHKCRFCAKVFGSDSALQIHLRSHTGER 

PYKCNVCGNRFTTRGNLKVHFHRHREKYPHVQ 

MNPHPVPEHLDYVITSSGLPYGMSVPPEKj\EEEA 

ATPGGGVERKPLVASTTALSATESLTLLSTSAGT 

ATAPGLPAFNKFVLMKAVEPKNKADENTPPGSE 

GSAISGVAESSTATRMQLSKLVTSLPSWALLTNH 

FKSTGSFPLPLCARALGVASPSETSKLQQLVEKDD 

RQGAVAVTSAASGAPTTSAPAPSSSASSGPMQCV 

ICLRVLSCPRALRLHYGQHGGERPFKCKVCGRAF 

STRONLRAHFVGHKASPAARAQNSCPICQKKFT 
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SEQED 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

correspondiDg 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^'Alanine C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F°Phenylalanine, G=Glycine, H«Histidine, 
I<°IsoIeucine, K^Lysine, Lr=Leuclne, M=Mctbionine, 
N'sAsparagine, P»Prollne, Q^Glutaniine, R-Arginine, S^Serine, 
T^Threonine, V-Valine, W»Tryptophan, Y»Tyrosine, 
X>^J^known, *=Stop codon, /-possible nucleotide deletion, 
\«possible nucleotide insertion 










NAVTLQQHVRMHLGGQIPNGGTALPEGGGAAQ 
ENGSEQSTVSGAGSFPQQQSQQPSPEEELSEEEEE 
EDEEEEEDVTDEDSLAGRGSESGGEKAISVRGDS 
EEASO AFFFVGTVAAAATA GKFMDSNEKTTOOS 

SLPPPPPPDSLDQPQPMEQGSSGVLGGKEEGGKP 

ERSSSPASALTPEGEATSVTLVEELSLQEAMRKEP 

GESSSRKACEVCGQAFPSQAAL\EEH\QKTHPKEG 

PLIATCVFCRQGFLERATLKKHMLLAHHQVQPFA 

PHGPQNIAALSLVPGCSPSITSTGLSPFPRBCDDPTI 

P 


3469 


A 


3 

* 


5664 


NLRPLSFALFLGDPNMANLEESFPRGGTRKIHKP 

EKAFQQSVEQDNLFDISTEEGSTKRKKSQKGPAK 

TKKLKIEKRESSKSAREKFEILSVESLCEGMRILG 

CVKEVNELELVISLPNGLQGFVQVTEICDAYTKK 

LNEQVTQEQPLKDLLHLPELFSPGMLVRCVVSSL 

GITDRGKKSVKLSLNPKNVNRVLSAEALKPGML 

LTGTVSSLEDHGYLVDIGVDGTRAFLPLLKAQEY 

IRQKNKGAKLKVGQYLNCrVEKVKGNGGWSLS 

VGHSEVSTAIATEQQSWNLNNLLPGLVVKAQVQ 

KVTPFGLTLNFLTFFTGVVDFMHLDPKKAGTYFS 

NQAVRACILCVHPRTRWHLSLRPIFLQPGRPLTR 

LSCQNLGAVLDDVPVQGFFKKAGATFRLKDGVL 

AYARLSHLSDSKNVFNPEAFKPGNTHKCRIIDYS 

QMDELALLSLRTSIIEAQYLRYHDIEPGAVVKGT 

VLmSYGMLVKVGEQMRGLVPPMHLADILMK 

NPEKKYHIGDEVKCRVLLCDPEAKKLMMTLKKT 

LIESKXPVITCYADAKPGLQTHGFURVKDYGCIV 

KFYNNVQGLVPKHELSTEYIPDPERVFYTGQVV 

KVVVLNCEPSKERMLLSFKLSSDPEPKKEPAGHS 

QKKGKAINIGQLVDVKVLEKTKDGLEVAVLPHN 

IRAFLPTSHLSDHVANGPLLHHWLQAGDILHRVL 

CLSQSEGRVLLCRKPALVSTVEGGQDPK^FFSEIH 

PGMLLIGFVKSDKDYGVFIQLPSGLSGLAPKAIMS 

DKFVTSTSDHFVEGQTVAAKVTNVDEEKQRMLL 

SLRLSDCGLGDLAITSLLLLNQCLEELQGVRSLM 

SNRDSVLIQTLAEMTPGMFLDLVVQEVLEDGSV 

VFSGGPVPDLVLKASRYHRAGQEVESGQKKKVV 

ILNYDLLBCLEVHVSLHQ\DLV\NRKARKLRKGSE 

HQAIVQHLEKSFAIASLVETGHLAAFSLTSHLND 

TFRFDSEKLQVGQGVSLTLKilEPGVTGLLLAVE 

GPAAKRTMRPTQKDSETVDEDEEVDPALTVGTI 

KKHTLSIGDMVTGTVKSIKPTHVVVTLEDGIIGCI 

HASHILDDVPEGTSPTTKLKVGKTVTARVIGGRD 

MKTFKYLPISHPRFVRTIPELSVRPSELEDGHTAL 

NTHSVSPMEKIKQYQAGQTVTCFLKKYNVVKK 

WLEVEIAPDIRGRIPLLLTSLSFKVLKHPDKKPRV 

GQALRATVVGPDSSKITLCLSLTGPHKLEEGEVA 

MGRVYKVTPNEGLTVSFPFGKIGTVSIFHMSDSY 

SETPLEDFVPOKVVRCYILSTADNVLTLSLRSSRT 

NPETKSKVEDPEINSIQDIKEGQLLRGYVGSIQPH 

GVFFRLGPSVVGLARYSHVSQHSPSKKALYNKH 

LPEGKLLTARVLRLNHQKNLVELSFLPGDTGKPD 

VLSASLEGQL1KQEERKTEAEERDQKGEKKNQK 

RNEBCKNQKGQEEVEMPSKEKQQPQKPQAQKRG 

GRECRESGSEQERVSKKPKKAGLSEEDDSLVDV 
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SEQ ID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^Alanine C=Cysteine, D='Aspartic Acid, 
E=Glutamic Acid, F«Pheny lata nine, G^Glycine, H^'Histidine, 
I<=I$o]eucine, K=Lysinc, L=Leucine, M»Metliionine, 
N^Asparagine, P'^Proline, Q»Glutamine, R=Aiginine, S»Serine, 
T»Tbreonine, V=Vallnc liV^Tryptoplian, Y=Tyrosine, 
X-Unknown, *»Stop codon, ^»possible nucleotide deletion, 
^possible nucleotide insertion 










YYEEGKEEAEETNVLPKEKQTKPAEAPRLQLSSG 

FAWNVGLDSLTPALPPLAESSDSEEDEKPHQATI 

KICSKKERELEKQKAEKELSRTEEALMDPGRQPE 

SADDFDRLVLSSPNSSILWLQYMAFHLQATEIEK 

ARAVAERALKTISFRJEEQEBCLNVWVALLNLENM 

YGSOESLTKVFERAVOYNEPLKVFLHLADrVAKS 

EKFQEAGELYNRMLKRFRQEKAVWIKYGAFLLR 

RSQAAASHRVLQRALECLPSKEHVDVIAKFAQL 

EFQLGDAERAKAIFENTLSTYPKRTDVWSVYID 

MTIKHGSQKDVRDIFERVIHLSLAPKRMKFFFKR 

YLDYEKQHGTEKDVQAVKAKALEYVEAKSSVL 

ED 


3470 


A 


2334 


1226 


TAAAPVAPGTMDDATVLRICKGYIVGINLGKGSY 
AKVKSAYSERLKFNVAVKIIARKKTPTDFVERFL 
PREMDILATVNHGSIIKTYEIFETSDGRIYIIMELG 

VQGDLLEFIKCQGALHEDVARKMFRQLSSAVKY 
CHDLDIVHRDT KCFNI LLDKnFMTKT.SrJFnF^KTl 

wXJU/IhmL'I V nfMrlfiTi* -P-ItI fl fl fl ■'f\l iT It 1 f%l nrl Fr ijfV P> 

CLRDSNGRIILSKTFCGSAAYAAPEVLQSIPYQPK 
VYDIWSLGVILYIMVCGSMPYDDSDIRKMLRIQK 
EHRVDFPRSKNLTCECKDLIYRMLQ\PDVS\KRLH 
FDEILSHS WLOPPKPKVATSS ASFKRFO FGIC YR AF 

CKLDTKTGLRPDHRPDHKLGAKTQHRLLVVPEN 
ENRMEDRLAETSRAKDHfflSGAEVGKAST 


3471 


A 


537 


148 


TERGAPQHPTLPLPSLTPSSVHTGQPKTTPSVILFL 
PSCEEPQANKATLVCLMNN/FYPGILMVTWKAD 
GTLITQSVEKTTPSKQSNNKYVASSYLSLTPEQW 
RSRRSYSCQVMQEGSTVEKSVAPAECS 


3472 


A 


1 


2272 


DKPTRHKTYLSSSWAKMAAAEGPVGDGELWQT 

WLPNHWFLRLREGLKNQSPTEAEKPASSSLPSS 

PPPQLLTRNWFGLGGELFLWDGEDSSFLWRLR 

GPSGGGEEPALSQYQRLLCINPPLFEIYQVLLSPT 

QHHVALIGIKGLMVLELPKRWGKNSEFEGGKST 

WCSTTPVAERFFTSSTSLTLKHAAWYPSEILbPH 

WLLTSDNVIRIYSLREPQTPTNVULSEAEEESLV 

LNKGRAYTASLGETAVAFDFGPLAAVPKTLFGQ 

NGKDEVVAYPLYILYENGETFLTYISLLHSPGN/I 

WKAVGSIAHASVAAEDNYGYDACAVLCLPCVPN 

ILVIATESGMLYHCVVLEGEEEDDHTSEKSWDSR 

IDLIPSLYVFECVELELALKLASGEDDPFDSDFSC 

PVKLHRDPKCPSRYHCTHEAGVHSVGLTWIHKL 

HKJLGSDEEDKDSLQELSTEQKCFVEHDLCTKPLP 

CRQPAPIRGFWIVPDILGPTMICITSTYECLIWPLL 

STVHPASPPLLCTREDVEVAESPLRVLAETPDSFE 

KHIRSILORSVANPAFLKASEKDIAPPPEECLOLLS 

RATQVFREQYILKQDLAKEEIQRRVKLLCDQKK 

KQLEDLSYCREERKSLREMAERLADKYEEAKEK 

QEDIMNRMKKLLHSFHSELPAn.SDSERDMKKEL 

QLIPDQLRHLGNAIKQVTMKKDYQQQKMEKVL 

SLPKPTIILSAYORKCIOSILKEEGEHIREMVKOrN 

DIRNHVNF 


3473 


A 


1 


2272 


DICPTRHKTYLSSSWAKMAAAEGPVGDGELWQT 

WLPNHVVFLRLREGLKNQSPTEAEKPASSSLPSS 

PPPQLLTRNVVFGLGGELFLWDGEDSSFLWRLR 

GPSGGGEEPALSQYQRLLCINPPLFEIYQVLLSPT 

QHHVALIGDCGLMVLELPKRWGKNSEFEGGKST 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

locatioD 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A»Alanine C=CysteiDe, D=Aspartic Acid» 
E^GIutamic Add, F°Phenylalaaine, GsKSIycine, H«Histidine, 
I'^Isoleudne, K=Lysine, I>Leudne, MHMethionine, 
N^Aspanigine, psproline, Q^Glutamine, R=Arginine, S^Serine, 
T^Threonine, V=Va!lne, W=Tryptophan, Y=Tyrosinc, 
X=Unknown» *-Stop codon, /-possible nucleotide deletion, 
\-possible nudeotide insertion 










VKCSTTPVAERFFTSSTSLTLKHAAWYPSEILDPH 

VVLLTSDNVIRIYSLREPQTPTNVIILSEAEEESLV 

LNKGRAYTASLGETAVAFDFGPLAAVPKTLFGQ 

NGKDEWAYPLYILYENGETFLTYISLLHSPGN/I 

WKAVGSIAHAS\AAEDNYGYDACAVLCLPCVPN 

ILVIATESGMLYHCVVLEGEEEDDHTSEKSWDSR 

IDLPSLYVFECVELELALKLASGEDDPFDSDFSC 

PVKLPIRDPKCPSRYHCTHEAGVHSVGLTWIHKL 

HKPLGSDEEDKDSLQELSTEQKCFVEHILCTKPLP 

CRQPAPIRGFWIVPDILGPTMICITSTYECLIWPLL 

STVHPASPPLLCTREDVEVAESPLRVLAETPDSFE 

KHIRSILORSVANPAFLKASEKDIAPPPEECLOLLS 

RATQVFREQYILKQDLAKEEIQRRVKLLCDQKK 

KQLEDLSYCREERKSLREMAERLADKYEEAKEK 

QEDIMNRNDCKLLHSFHSELPVLSDSERDMKKEL 

QL1PDQLRHLGNAIKQVTMKKI)YQQQKMEKVL 

SLPKPTIILSAYQRKCIQSILKEEGEHIREMVKQIN 

DIRNHVNF 


3474 


A 


4344 


2550 


DRRREPERHVRVKQRTSVLNMLRRLDKIRFRGH 

KRDDFLDLAESPNASDTECSDEIPLKVPRTSPRDS 

EELRDPAGPGTLIMATGVQDFNRTEFDRLNEIKG 

HLEIALLEKHFLQEELRKLREETNAEMLRQELDR 

ERQRRMELEQKVQEVLKARTEEQMAQQPPKGQ 

AQASNGAERRSQGLSSRLQKWFYERFGEYVEDF 

RFQPEENTVETEEPLSARRLTENMRRLKRGAKPV 

TNF\^M.SALSDWYSWTSAIAFTVYMNAV\ra 

GWAJDPLFLFLAILRLSLNYLIARGWRIQWSIVPEV 

SEPVEPPKEDLTVSEKFQLVLDVAQKAQNLFGK 

MADILEKIKNLFMWVQPEITQKLYVALWAAFLA 

SCFFPYRLVGLAVGLYAGIKFFLIDFIFKRCPRLR 

AKYDTPYnWRSLPTDPQLKERSSAAVSRRLQTTS 

SRSYVPSAPAGLGBCEEDAGRFHSTKKGNFHEIFN 

LTEMERPLAVCENGWCCLINRDRKMPTDYIRN 

GVLYVT\ENYLCFESSKSGSSKRNKVIKLVDITDI 

QKYKVLSVLPGSGMGIAVSTPSTQKPLVFGAMV 

HRDEAFETILSQYIKITSAAASGGDS 


3475 


A 


2 


1126 


TAARRRQKGAAAAAETHGQAKAKSGWLKPYYF 

mLMESRKDITNQEELWKMKPRRNLEEDDYLHK 

DTGETSMLKRPVLLHLHQTAHADEFDCPSELQH 

TQELFPQWHLPIKIAAIIASLTFLYTLLREVIHPLA 

TSHQQYFYKIPILVINKVLPMVSITLLALVYLPGV 

lAAIVQLHNGTKYKKFPHWLDKWMLTJRKQFGL 

LSFFFAVLHAIYSLSYPMRRSYRYKLLNWAYQQ 

VQQNKEDALMEHDVWRMEIYVSLGIVGLAILAL 

LAVTSIPSVSDSLTWREFHYIQSKLGIVSLLLGTIH 

ALIFAVWKWIDIKQFVWYTPPTFMIAVFLPIVVLr 

FKSILFLPCLRKKILKIRHGWEDVTKINKTEICSQL 


3476 


A 


143 


3191 


AKAPPTGESSEPEAKVLHTKRLYRAVVEAVHRL 

DLILChnCTAYQEVFKPEMSLRNKLRELCVKLMF 

LHPVDYGRKAEELLWRKVYYEVIQLIKTNKKHI 

HSRSTLECAYRTHLVAGIGFYQHLLLYIQSHYQL 

ELQCCIDWTHVTDPLIGCKKPVSASGKEMDWAQ 

MACHRCLVYLGDLSRYQNELAGVDTELLAERFY 

YQALSVAPQIGMPFNQLGTLAGSKYYNVEAMY 

CYLRCIQSEVSFEGAYGNLKRLYDKAAKMYHQL 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

coirespondiog 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A«Alanine C<=Cysteine, INAspartic Acid, 
E^GIutamic Acid, F^Pbenylalanine, G=Glycine, H^Histidine, 
i=IsoIeucine, K^Lysine, L=Leucine, M=Methionine, 
N°Asparagine, P^Prollne, Q^Glutamine, R>Arginine, S=>Seriae, 
l^Threonine, V«Va!inc, W°Tryptophan, Y«Tyrosine, 
X-Unknown, *»Stop codon, A^possible nucleotide deletion, 
^possible nucleotide insertion 










KKCETRKLSPGKXRCKDIKRLLVNFMYLQSLLQ 

PKSSSVDSELTSLCQSVLEDFNLCLFYLPSSPNLS 

LASEDEEEYESGYAFLPDLLIFQMVIICLMCVHSL 

ERAGSKQYSAAIAFTLALFSHLVNHVNIRLQAEL 

EEGENPVPAFQSDGTDEPESKEPVEKEEEPDPEPP 

PVTPQVGEGRKSRKFSRLSCLRRRRHPPKVGDDS 

DLSEGFESDSSHDSARASEGSDSGSDKSLEGGGT 

AFDAETDSEMNSQESRSDLEDMEEEEGTRSPTLE 

PPRGRSEAPDSLNGPLGPSEASIASNLQAMSTQM 

FQTKRCFRLAPTFSNLLLQPTTNPHTSASHRPCV 

NGDVDKPSEPASEEGSESEGSESSGRSCRNERSIQ 

EKLQVLMAEGLLPAVKVFLDWLRTNPDLUVCA 

QSSQSLWNRLSVLLNLLPAAGELQESGLALCPEV 

QDLLEGCELPDLPSSLLLPEDMALRNLPPLRAAH 

RRFNFDTDRPLLSTLEESVVRICCIRSFGHFIARLQ 

QEEARR>IRLMRDMAQLRLQLEVSQLEGSLQQPK 

AQSAMSPYLVPDTQALCHHLPVIRQLATSGRFIVI 

IPRTVIDGLDLLKKEHPGARDGIRYLEAEFKKGN 

RYIRCQKEVGKSFERHKLKRQDADAWTLYKILD 

SCKQLT\LAQGAGEEDPSGMVTIITGLPLDNPSVL 

SGPMQAALQAAAHASVDIKNVLDFYKQWKEIG 


3477 


A 


1 


3902 


MTEPRERRGYSVPPRPEVGTQATEWRVEESNFN 

KIFLKKDAELGRSNHLPTWDKPEDASWLPQSCL 

GGDAVATTGEIHEEKAWKTRALEVGQPAQRDIR 

RGELWGKEHGADQAIQETLEDLSSLERTLVVSES 

SPLGGDCQEVTTLTVKYQVSEEVPSGTVIGKLSQ 

ELGREERRRQAGAAFQVLQLPQALPIQVDSEEGL 

LSTGRRLDREQLCRQWDPCLVSFDVLATGDLALI 

HVEIQVLDINDHQPRPPKGEQELEISESASLRTRIP 

LDRALDPDTGPNTLHTYTLSPSEHFALDVIVGPD 

ETKHAELIVVKELDREIHSFFDLVLTAYDNGNPP 

KSGTSLVKVNVLDSNDNSPAFAESSLALEIQEDA 

APGTLLDCLTATDPDQGPNGEVEFFLSKHMPPEW 

LDTFSIDAKTGQVILRRPLDYEKNPAYEVDVQAR 

DLGPWIPAHCKVLIKVLDVNDNIPSIHVTWASQP 

SLVSEALPKDSFIALVMADDLDSGNNGLVHCWL 

SQELGHFRLKRTNGNTYMLLTNATLDREQWPK 

YTLTLLAQDQGLQPLSAKKQLSIQISDrNDNAPVF 

EKSRYEVSTRENNLPSLHLITIKAHDADLGINGK 

VSYRIQDSPVAHLVAIDSNTGEVTAQRSLNYEEM 

AGFEFQVIAEDSGQPMLASSVSVWVSLLDANDN 

APEVVQPVLSDGKASLSVLVNASTGHLLVPIETP 

NGLGPAGTDTPPLATHSSRPFLLTTIVARDADSG 

ANGEPLYSIRSGNEAHLFILNPHTGQLFVNVTNA 

SSLIGSEWELEIVVEDQGSPPLQTRALLRVMFVTS 

VDHLRDSARKPGALSMSMLTVICLAVLLGIFGLI 

LALFMSICRTEKKDNRAYNCREAESTYRQQPKR 

POKHIOKADIHLVPVLRGOAGEPCEVGOSHKDV 

DKEAMMEAGWDPCLQAPFHLTPTLYRTLRNQG 

NQGAPAESREVLQDTVNLLFNHPRQRj^ASREWL 

NLPEPQPATGQPRSRPLKVAGSPTGRLAGDQGSE 

EAPQRPPASSATLRRQRHLNGKVSPEKESGPRQI 

LRSLVRLSVAAFAERNPVEELTVDSPPVQQISQLL 

SLLHQGQFQPKPNHRGNKYLAKPGGSRSAIPDTD 
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SEQID 
NO: 


Method 

> 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to flrst amino 
acid residue of 
peptide 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
. acid residue of 
peptide 
sequence 


Amino acid sequence (A==Alanine OCysteine, INAspartic Acid, 
E=Glutaniic Acid, F==Phenylalanine, G~Glycine, H-Histidine, 
I=l80leuGine, K^Lysine, L^Leucine, M =Methionlne, 
N-Asparagine,P»ProUne, Q-Glutamine, R^Arginine, S^Serine, 
T=»Threonine, V-»Valine, W=Tryptophan, Y«^rosine, 
X='Unl(nown, *=Stop codon» A=possible nucleotide deletion, 
\=possible nucleotide insertion 










GPSARAGGQTDPEQEEGPLDPEEDLSVKQLLEEE 

LSSLLDPSTGLALDRLSAPDPAWMARLSLPLTTN 

YRDNVISPDAAATEEPRTFQTFGKAEAPELSPTG 

TRLASTFVSEMSSLLEMLLEQRSSMPVEAASEAL 

RBLSVCGRTLSLDLATSAASGMKVQGDPGGKTG 

TEGKSRGSSSSSRCL 


3478 


A 


13 

• 


1620 


TLPPPGNSGCHRLCFPEFEFLQVTKMEFSGRKWR 

KLRLAGDQRNASYPHCLQFYLQPPSENISLIEFEN 

LAIDRVKLLKSVENLGVSYVKGTEQYQSKLESEL 

RKLKFSYRENLEDEYEPRRRDHISHFILRLAYCQS 

EELRRWFIQQEMDLLRFRFSILPKDKIQDFLKDSQ 

LQFEAISDEEKTLREQEIVASSPSLSGLKLGFESIY 

KIPFADALDLFRGRKVYLEDGFAYVPLKDrVAIIL 

NEFRAKLSKALALTARSLPAVQSDERLQPLLNHL 

SHSYTGQDYSTQGNVGKISLDQIDLLSTKSFPPC 

MftOT AT RKNTHHI RtrGGRMOYGI FT KGTGT T 

LEQALQFWKQEFIKGKMDPDKFDKGYSYNIRHS 

FGKEGKRTDYTPFSCLKIBLSNPPSQGDYHGCPFR 

HSDPELLKQKLQSYKISPGGISQILDLVKGTHYQ 

V\ACQKYFEMIHTVDDCGFS\LSHPNQYFCESQRI 

LNGGKDIKKEPIQPETPQPKPSVQKTKDASSALA 


3479 


A 


698 


138 


RPELELWRLRSRSWRPLGVPRRCHRItNWKEPVR 

AQPLSVTVWAPRCQRP/QPPAPEPSSPNAAVPEAI 

PTPRAAASAALELPLGPAPVSVAPQAEAEARSTP 

GPAGSRLGPETFRQRFRQFRYQDAAGPREAFRQL 

REL/SPRQWLRPDI\RTKEQ\1VEMLVQEQLLAILP 

EAARARRIRRRTDVRITG 


3480 

■ 


A 

• 


117 


2226 
■ 


RRGSRSRGPFAEPAAPGGLCSSSEEKTEEGGMAV 

GLCKAMSQGLVTFRDVALDFSQEEWEWLKPSQ 

KDLYRDVMLENYRNLVWLGLSISKP>JMISLLEQ 

GKEPWMVERKMSQGHCADWESWWEIEELSPK 

WFIDEDEISQEMVMERLASHGLECS SFREA WKY 

KGEFELHQGNAERHFMQVTAVKEISTGKRDNEF 

SN/IWEKHTPEISIFN'ri'KS\PTIQQVHKFDIYDKLF 

PQNSVTIEYKRLHAEKESLIGNECEEFMQSTYLSK 

DIGIPPGEKPYESHDFSKLLSFHSLFTQHQTTHFG 

KLPHGYDECGDAFSCYSFFTQPQRIHSGEKPYAC 

NDCGKAFSHDFFLSEHQRTHIGEKPYECKECNKA 

FRQSAHLAQHQRJHTGEKPFACNECGKAFSRYAF 

LVEHQRIHTGEKPYECKECNKAFRQSAHLNQHQ 

RIHTGEKPYECNQCGKAFSRRIALTLHQRIHTGE 

KPFKCSECGKTFGYRSHLNOHORIHTGEKPYECI 

KCGKFFRTDSQLNRHHRIHTGERPFECSKCGKAF 

SDALVLIHHKRSHAGEKPYECNKCGICAFSCGSY 

LNQHQRIHTGEKPYECSECGKAFHQILSLRLHQRI 

HAGEKPYKCNESQRVFRSELAVSRGLTTKPADT 

GPDSTLNAAKVAEPARAGTEAALRPALSVAESA 

TSLGPLHQGRRFPEAPAAHPGGTGFTVCAS 


3481 


A 


2 


1522 


ASRHGMTPGALLMLLGALGPPLAPGVRGSEAEG 
RLREKLFSGYDSSVRPAREVGDRVRVSVGLILAQ 
LISLNEKDEEMSTKVYLDLEWTDYRLSWDPAEH 
DGIDSLRITAESVWLPDWLLNNM)GMFDVALDI 
SVVVSSDGSVRWQPPGIYRSSCSIQVTYFPFDWQ 
NCTMVFSSYSYDSSEVSLQTGLGPDGQGHQEIHI 
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SEQU) 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A«Alanine OCystelne, I>=Aspartic Acid, 
E^GIutamic Acid, F=Phenylalanine, G=Glycine, H=Hi5tidine, 
I=Isoleucine» K-Lysine, D=Leucine, M»Methionlne, 
N=A$paragine,P=Proline, Q»Glutamine, R-Arginine, S«=Seritte, 
T^Tbreonlne^ V=Valine, W«"Tiyptoplian, Y^Tyrosinc, 
X»Unknown, *=Stop codon, A=possible nucleotide deletioiit 
V'^Vossible nucleotide insertion 










HEGTFIENGQWENIHKPSRLIQPPGDPRGGREGQ 

RQEVIFYLIIRRKPLFYLVNVIAPCILITLLAIFVFY 

LPPDAGFKMGLSIFALLTLTVFLLLLADKVPETSL 

SVPIIIKYLlVIFITVrsn.VTFSVILSVVVLNLHHRSPH 

THQMPLW\^QIFIHKLPLYLRLKRPKPERDLMPE 

PPHCSSPGSGWGRGTDEYFIRKPPSDFLFPICPNRF 

QPELSAPDLRRFEDGPNRAVALLPELREVVSSISYI 

ARQLQEQEDHDALKEDWQFVAMWDRLFLWTF 

IIFTSVGTLWIFLDATYHLPPPDPFP 


3482 


A 


1273 


172 


ER>VDSGGADAEWYALADWTAVWLPRSDFYTR 

LQTGEGHVPALRLPAGMPPDSPRELVPKQAPCSP 

SDPALPWTLGHGNQPPAWPEPQGPMGPAGVAA 

RPGRFFGVYLLYCLNPRYRVRWYVGFTVNTARR 

VQQHNGGRKKGGA\GRTSGRGPWEMVLVVHGF 

Too V AAi-Jvr c W A W v^xlrxl/vaivivLiAri v uri\L#i\.oc i 

AFAFHLRVLAHMLRAPPWARLPLTLRAWRPDLR 

QDLCLPPPPHVLLAFGPPPAQVPRPQRERAGPFD 

DAEPEPDQGDPGACCSLCAQTIQDEEGPLCCPHP 

GCLLRAHVICLAEEFLQEEPGQLLPLEGQCPCCE 

KSLLWGDLIWLCQMDTEKEVEDSELEEAHWTD 

LLET 


3483 


A 


230 


3686 


WRPWPCIDTSWNLQVAARTLRVSSAQCGLVPT 

MARVESPVPAARASLTGSCVLGQAMPLRGGAGP 

SPASHGPTHGPSDPRTCLPGRGAGGMRPHGRGA 

LGCCGLCSFYTCHGAAGDEIMHQDIVPLCAADIQ 

DQLKKRFAYLSGGRGQDGSPVITFPDYPAFSEIPD 

KEFQNVMTYLTSIPSLQDAGIGFILVE)RRRDKW 

TSVKASVLRIAASFPANLQLVLVLRPTGFFQRTLS 

DIAFKFNRDDFKMKVPVIMLSSVPDLHGYIDKSQ 

LTEDLGGTLDYCHSRWLCQRTAIESFALMVKQT 

AQMLQSFGTELAETELPNDVQS-nSSVLCAHTEK 

KDKAKEDLRLALKEGHSVLESLRELQAEGSEPSV 

NQDQLDNQATVQRLLAQLNETEAAFDEFWAKH 

QQKLEQCLQLRHFEQGFREVKAILDAASQKIATF 

TDIGNSLAHVEHLLRDLANFQEKSGVFVERARA 

LSLTASSFIGNKHYAVDSIRPKCQELRHLCDQFSA 

EIARRRGLLSKSLELHRRLETSMKWCDEGIYLLA 

SQPVDKCQSQDGAEAALQEIEKFLETGAENKIQE 

LNAIYKEYESILNQDLMEHVRKVFQKQASMEEV 

FHRRQASLKKLAARQTRPVQPVAPRPEALAKSP 

CPSPGERRGSENSSSEGGALRRGPYRRAKSEMSES 

RQGRGSAGEEEESLAILRRHVMSELLDTERAYVE 

ELLCVLEGYAAEMDNPLMAHLLSTGLHNKKDV 

LFGNI^EIYHFHNRIFLRELENYTDCPELVGRCF 

LERMEDFQIYEKYCQNKPRSESLWRQCSDCPFFQ 

ECQRKLDHKLSLDSYLLBa^VQRTIKYQLLLKEM 

LKYSRNCEGAEDLQEALSSILGILKAYNDSMHLI 

AITGYDGNLGDLGKLLMQGSFSVWTDHKRGHT 

KVKELARFKPMORHLFLHEKAVLFCKKREENGE 

GYEKAPSYSYKQSLNMAAVGITENVKGDAKKFE 

IWYNAREEVYIVQAPTPEIKAAWVNEIRKVLTSQ 

LQACREASQHRALEQSQSLPLPAPTSrSPSRGNSR 

>IIKKLEERKTDPLSLEGYVSSAPLTKPPEKGKGW 

SKTSHSLEAPEDDGGWSSAEEQINSSDAEEDGGL 

GPKKLVPGKYTWADHEKGGPDALRVRSGDVV 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotiae 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
locattoo 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^Alanine C^ysteine, D^Aspartic Acid, 
E=GIutamlc Acid, F=Phenylalanine, G^GIycine, H^UIstidine, 
I=Isoleucine, K=^Lysine, L^Leucine, M^Metliionine, 
N°Asparagine, P-Proline, Q^GIutaniine, R^Arginlne, S=^rine, 
T^OIireonine, V»Valine, W«OrryptDphan, Y»l>rosine, 
X«Vnlcnown, *^top codon, A^possible nucleotide deletion, 
^possible nucleotide insertion 










ELVQEGDEGLW 


3484 


A 


208 


6103 


VTMAQQAADKYLYVDKNFINNPLAQADWAAK 

KLVWVPSDKSGFEPASLKEEVGEEATVELVENGK 

KVKVNKDDIQKMNPPKFSKVEDMAELTCLNEAS 

VLHNLKERYYSGLIYTYSGLFCWINPYKNLPIYS 

EEIVEMYKGKKRHEMPPHIYAITDTAYRSMMQD 

REDQSILCTGESGAGKTENTKKVIQYLAYVASSH 

KSKKDQGELERQLLQANPILEAFGNAKTVKNDN 

SSRFGKFIRINFDVNGYIVGANIETYLLEKSRAIRQ 

AKEERTFHIFYYLLSGAGEHLKTDLLLEPYNKYR 

FLSNGHVTIPGQQDKDMFQETMEAMRIMGIPEEE 

QMGLLRVISGVLQLGNTVFKKERNTDQASMPDN 

TAAQKVSHLLGINVTDFTRGILTPRIKVGRDYVQ 

KAQTKEQADFAffiALAKATYERMFRWLVLRINK 

ALDKTKRQGASFIGILDIAGFEIFDLNSFEQLCINY 

TNEKLQQLFNHIMFILEQEEYQREGIEWNFIDFG 

LDLQPCIDLIEKPAGPPGILALLDEECWFPKATDK 

SFVEKVMQEQGTHPKFQKPKQLKDKADFCnHY 

AGKVDYKADEWLMKNMDPLNDNIATLLHQSSD 

KFVSELWKDVDRIIGLDQVAGMSETALPGAFKT 

RKGMFRTVGQLYKEQLAKLMATLRNTNPNFVR 

CIIPNHEKKAGKLDPHLVLDQLRCNGVLEGIRICR 

QGFPNRWFQEFRQRYEILTPNSIPKGFMDGKQA 

CVLMIKALELDSNLYRIGQSKVFFRAGVLAHLEE 

ERDLKITDVIIGFQACCRGYLARKAFAKRQQQLT 

AMKVLQRNCAAYLKLRNWQWWRLFTKVKPLL 

QVSRQEEEMMAKEEELVKVREKQLAAENRLTE 

METLQSQLMAEKLQLQEQLQAETELCAEAEBLR 

ARLTAK\KQ\ELEEICHDLEARVEEEEERCQHLQA 

EKKKMQQNIQELEEQLEEEESARQKLQLEKVTT 

EAKLKKLEEEQIILEDQNCKLAKEKKLLEDRIAEF 

TTNLTEEEEKSKSLAKLK3SIKHEAMITDLEERLRR 

EEKQRQELEKTRRKLEGDSTDLSDQ1AELQAQ\IA 

ELKMQLAKKEEELQAALARVEEEAAQKNMALK 

K1RELESQISELQEDLKCER\ASRNKAEKQKRDLG 

EELEALKTELEDTLDSTAAQQELRSKREQEVNIL 

KKTLEEEAKTHEAQIQEMRQKHSQAVEELAEQL 

EQTKRVKANLEKAKQTLENERGELANEVKVLLQ 

GKGDSEHKRKKVEAQLQELQVKFNEGERVRTEL 

ADKVTKLQVELDNVTGLLSQSDSKSSKLTKDFS 

ALESQLQDTQELLQEENRQKLSLSTKLKQVEDE 

KNSVFREQLEEEEEEAKHNLEKQIATLHAQVADM 

KKKMEDSVGCLETAEEVKRXLQKDLEGLSQRHE 

EKVAAYDKLEKTKTRLQQELDDLLVDLDHQRQ 

SACNLEKKQKKFDQLLAEEKTISAKYAEERDRA 

EAEAREKETKALSLARALEEAMEQBCAELERLNK 

QFRTEMEDLMSSKDDVGKSVHELEKSKRAIEQQ 

VEEMKTQLEELEDELQATEDAKLRLEVNLQAM 

KAQFERDLQGRDEQSEEKKKQLVRQVREMEAE 

LEDERKQRSMAVAARKKLEMDLKDLEAHTOSA 

NKNRDEAIKQLRKLQAQMKDCMRELDDIKASR 

EEILAQAKENEKKLKSMEAEMIQLQEELAAAER 

AKRQAQQERDELADEIANSSGKGALALEEKRRL 

EARIAQLEEEIJEEEQGNTELINDRLKKANLQIDQI 

NTDLNLERSHAQKNENARQQLERQNKELKVKL 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C'=Cysteine, D='Aspartic Acid, 
£=Glataniic Acid, F-^Phenylalanine, G'^Glycine, H»Histidine, 
I«Isoleucine, K-Lysine, L^Leuclne, M°Methioninc, 
N»Asparagine, P>=Proline, Q^GIutamine, RsArginine, S»5erinc, 
T^Threonine, V^Valine, W«Tryptophan, Y=Tyrosinc, 
X'=UnIcnown, *=Stop codon, /^possible nucleotide deletion, 
\Bpos5ible nucleotide insertion 


• 








QEMEGTVKSKYKASITALEAKIAQLEEQLDNETK 

ERQAACKQVF^TEKKLKDVLLQVDDERRNAEQ 

YKDQADKASTRLKQLKRQLEEAEEEAQRANASR 

RKLQRELEDATETADAMNREVSSLKNKLRRGDL 

PFWPRRMARKGAGDGSDEEVDGKADGAEAKP 


3485 


A 


2 


1782 


CSTGVSKAPLTYLMSYGFELGWRKGNRAVACR 

EDRGGESVGMGQESILSQVHWWEAEPVEKTPGR 

DSEATIMSLRVHTLPTLLGAVVRPGCRELLCLLM 

ITVTVGPGASGVCPTACICATDIVSCTNKNLSKVP 

GNLFRLIKRLDLSYNRIGLLDSEWIPVSFAKLNTL 

ILRHNNlTSISTGSFSriPNLKCLDLSSNKLKT^VK 

NAVFQELKVLEVLLLYNNHISYLDPSAFGGLSQL 

QKLYLSGNFLTQFPMDLYVGRFKLAELMFLDVS 

YNRIPSMPMHHINLVPGKQLRGryLHGNPFVCDX 

CSLVSLLVFWYRRHFSSVMDFKNDYTCRLWSDS 

RHSRQVLLLQDSFMNCSDSIINGSFRALGnHEAQ 

VGERLMVHCDSKTGNANTDFIWVGPDNRLLEPD 

KEMENFYVFHNGSLVIESPRFEDAGVYSCIAMNK 

QRLLNETVDVTINVSNFTVSRSHAHEAFNTAFTT 

LAACVASIVLVLLYLYLTPCPCKCKTKRQKNML 

HQSNAHSSILSPGPASDASADERKAGAGKRVVFL 

EPLKDTAAGQNGKVRLFPSEAVIAEGILKSTRGK 

SDSDSVNSVFSDTPFVAST 


34S6 


A 


357 


1173 


GDPRETKVFPSRSFARNTVGVSHHQSHLFHTVSR 
lYVEDKHKILYCEVPKAGCSNWKRILMVLNGLA 

SSAYNISHNAVHYGKHLKKLDSFDLKGIYTRLDT 

YTK\LVLVRDPMERLVSAFRDKFDHPNSYYHPVF 

GKAIIKKYRPNACEEALINGSGVKFKEFIHYLLDS 

HRPVGMDIHWEKVSKLCYPCLINYDFVGKFETL 

EEDANYFLQMIGAPICELKFPNFKDRHSSDERTNA 

QVVRQYLKDLTRTERQLIYDFYYLDYLMFNYTT 

PFL 


3487 


A 


2 


3281 


CDKSGAVPFSTTRSPRRPSPRSAGPSLSSVSPRSQ 

LWASSGLSEEHAAPLLPAWPRHPCPPSLTPGPSM 

AQGAMRFCSEGDCAISPPRCPRRWLPEGPVPQSP 

PASMYGSTGSLLRRVAGPGPRGRELGRVTAPCTP 

LRGPPSPRVAPSPWAPSSPTGQPPPGAQSSWIFR 

FVEKASVRPLNGLPAPGGLSRSWDLGGVSPPRPT 

PALGPGSNRKLRLEASTSDPLPARGGSALPGSRN 

LVHGPPAPPQVGADGLYSSLPNGLGDPPERLATL 

FGGPADTGFLNQGDTWSSPREVSSHAQRIARAK 

WEFFYGSLDPPSSGAKPPEQAPPSPPGVGSRQGS 

GVAVGRAAKYSETDLDTVPLRCYRETDIDEVLA 

EREEADSAIESQPSSEGPPGTAYPPAPRPGPLPGP 

HPSLGSGNEDEDDDEAGGEEDVDDEVFEASEGA 

RPGSRMPLKSPVPFLPGTSPSADGPDSFSCVFEAI 

LESHRAKGTSYTSLASLEALASPGPTQSPFFTFEL 

PPQPPAPRPDPPAPAPLAPLEPDSGTSSAADGPWT 

QRGEEEEAEARAKLAPGREPPSPCHSEDSLGLGA 

APLGSEPPLSQLVSDSDSELDSTERLALGSTDTLS 

NGQKADLEAAQRLAKRLYRLDGFRKADVARHL 

GKNNDFSKLVAGEYLKFFVFTGMTLDQALRVFL 

KELALMGETQERERVLAHFSQRYFQCNPEALSSE 

DGAHTLTCALMLLNTDLHGHNIGKRMTCGDnG 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

locatioo 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=»Aianine OCysteinc, I^Aspartic Acid, 
E^lutamic Acid, F=PbenylaIanine, G=Glycine, H=Histidlne, 
I«Isoleucine, K»Lysine, L=Leucine, M-Metliionine, 
N»Asparagine, P^Proline, Q^Iutamine, R=Arginine, S=Serine, 
T^'Tlireonine, V=Valine, W=Tryptophan, Y=Tyrosine, 
X-Unlcnown, *=Stop codon, /==possibIe nucleotide deletion, 
V»possible nucleotide insertion 










NLEGLNDGGDFPRELLKAL YS SIKNEKLQ WAIDE 

EELRRFLSELADPNPK VIKRISGGSGSGS SPFLDLT 

PEPGAAVYKHGALVRKVHADPDCRKTPRGKRG 

WKSFHGILKGMILYLQKEEYKPGKALSETELKN 

AISIHHALATRASVNYSKRPHVFYLRTADWRVFL 

FQAPSLEQMQSWITRINVVAAMFSAPPFPAAVSS 

QKKFSRPLLPSAATRLSQEEQVRTHEAKLKAMA 

SELREHRAAQLGKKGRGKEAEEQRQKEAYLEFE 

KSRYSTYAALLRVKLKAGSEELDAVEAALAQAG 

STEDGLPPSHSSPSLQPKPSSQPRAQRHSSEPRPG 

AGSGRRKP 


3488 


A 


441 


1968 


GTETPHCWGRGTAGLRRELDRJEERDGPGTATMS 

FPHFGHPYRGAFQFLVASASSSTTCCESTLRSVSY 

VASGSTPAPALCCAPNYDSRLLGSARPELGAALGI 

YGAPYAAAAAAQSYPGYLPYSPEPPSLYGALNP 

QYEFKEAAGSFTSSLAQPGAYYPYERTLGQYQY 

ERYGAVELSGAGRRKNATRETTSTLKAWLNEHR 

KNPYPTKGEKIMLAIITKMTLTQVSTWFANARRR 

LKKENKMTWAPKNKGGEERKAEGGEEDSLGCL 

TADTKEVTASQEARGLRLSDLEDLEEEEEEEEEA 

RLERRECGLAAPRFSFNDPSGSEEADFLSAETGSP 

RLTMHYPCLEKPRJWSLAHTATASAVEGAPPARP 

RPRSPECRMIPGQPPASARRLSVPRDSACDESSCI 

PKAFGNPKFALQGLPLNCAPCPRRSEPWQCQYP 

SGAEGSGPPAALGVSMQKTPTYRPARQLHTLCH 

SSLP 


3489 


A 


718 


2073 


L\AYHKALSYRGHVHANNRGrNNVHFTPPPSPS 

RGILPMNPRNMMNHSQVGQGIGIPSRTNSMSSSG 

LGSPNRSSPSIICMPKQQPSRQPFTVNSMSGFGMN 

RNQAFGMNNSLSSNIFNGTDGSENVTGLDLSDFP 

ALADRNRREGSGNPTPLINPLAGRAPYVGMVTK 

PANEQSQDFSIHNEDFPALPGSSYKDPTSSNDDSK 

SNLNTSGKTTSSTDGPKFPGDKSSTTQNNNQQKK 

GIOVLPDGRVTNIPOGMVTDOFGMIGLLTFIRAA 

ETDPGMVHLALGSDLTTLGLNLNSPENLYPKFAS 

PWASSPCRPQDIDFHVPSEYLTNIHIRDKLFFFFS 

W/TAEECLGRYGEDLLFYLYYMNGGDVLQLLAAV 

ELFNRDWRYHKEERVWITRAPGMEPTMKTNTY 

ERGTYYFFDCLNWRKVAKEFHLEYDKLEERPHL 

PSTFNYNPAQQAF 


3490 


A 


2 


2833 


FVAKMATSQYFDFAQGGGPQYSTQAPTLPLPTV 

GASYTGQPTPGMDPAVNPAFPPAAPAGYGGYQP 

HSGQDFAYGSRPQEPVPTATTMATYQDSYSYGQ 

SAAARSYEDRPYFQSAALQSGRMTAADSGQPGT 

QEACGQPSPHGSHSHAQPPQQAPIVESGQPASIL 

SSGYTYPTATGVQPESSASIVTSYPPPSYNPTCTA 

YTAPSYPNYDASVYSAASPFYPPAQPPPPPGPPQ 

OLPPPPAPAGSGSSPRADSKPPLPSKLPRPKAGPR 

QLQLHYCDICKISCAGPQTYREHLGGQKHRKKE 

AAQKTGVQPNGSPRGVQAQLHCDLCAVSCTGA 

DAYAAHIRGSKHQKVFKLHAKLGKPIPTLEPALA 

TESPPGAEAKPTSPTGPSVCASSRPALAKRPVASK 

ALCEGPPEPQAAGCRPQWGKPAQPKLEGPGAPT 

QGGSKEAPAGCSDAQPVGPEYVEEVFSDEGRVL 
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SEQID 
NO: 


Method 

• 


Predicted 

beginning 

nucleotide 

locatfon 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A-Alanlne C=Cysteine, D=Aspartic Acid, 
E^GIutamlc Acid, F^Phenylalanine, GM^Iycine, HsHistidine, 
I^Isoleucine, K^Lysine, Lr=Leucine, M=Metliionioe, 
N^Asparagine, P^ProIine, (MSIutamine, R=Arginine, S=Serine, 
l^hreonine, V^Vallne, W«Tryptophan, Y»Tyrosinc, 
X»Unknown, *»Stop codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion 










RFHCKLCECSFNDLNAKDLHWGRRHRLQYRKK 

VNPDLPIATEPSSRARKVLEERMRKQRHLAEERL 

EQLRRWHAERRRLEEEPPQDVPPHAPPDWAQPL 

LMGRPESPASAPLQPGRRPASSDDRHVMCKHAn 

YPTEQELLAVQRAVSHAERALKLVSDTLAEEDR 

GRREEEGDKRSSVAPQTRVLKGyMRVGn.AKGL 

LLRGDR]SrvnRLALLCSEKPTHSLLRRIAQQLPRQL 

QMVTEDEYEVSSDPEANIVISSCEEPRMQVTISVT 

AALRHARWFQARASGLQPCVIVIRVLRDLCRRV 

PTVWGALPAWAMELLVEKAVSSAAGPLGPGDAV 

RRVLECVATGTLLTDGPGLQDPCERDQTDALEP 

MTLQEREDVTASAQHALRMLAFRQTHKVLGMD 

LLPPRHRLGARFRKRQRGPGEGEEGAGEKKRGR 

RGGEGLV 


3491 


A 


2 

« 


1321 


FVGDGALSGCRRGRAPRVPSMAGSLPPCVVDCG 

TGYTKLGYAGNTEPQFIIPSCIAIRESAKWDQAQ 

RRVLRGVDDLDEFIGDEAIDKPTYATKWPIRHGII 

EDWDLMERFMEQWFKYLRAEPEDHYFLMTEP 

PLNTPENREYLAEIMFESFNVPGLYIAVQAVLAL 

AASWTSRQVGERTLTGIVIDSGDGVTHVIPVAEG 

WTnQPTVUTDTAnBTl'PrVPTr^r^T f PP1>17VriTDD17r%Q 
I V lOol^JJSJliirlAvJJUJl 1 Y r ll^t^l^LrKCivb V VJlr t^iiV;o 

LETAKAIKEKYCYICPDIVKEFAKYDVDPRKWIK 

QYTGINAINQKKFVIDVGYERPLGPEIFFHPEFAN 

PDFMESISDVYDEVIQNCPIDVRRPLYKNVVLSG 

GSTMFRDFGRRLQRDLKRVVDARLRLSEELSGG\ 

RIKPKPVEVQWTHHMQRYAV\WFGG\SMLASTP 

EFFQVCHTKKDYEEYGPSICRHNPVFGVMS 


3492 


A 


3 


2024 


PNGVALLHLPGAAVIPNTNYMFQDALGGRSRGS 

REESPAPSRAPASASLWRRLVVVEABCMAAHAAA 

AAQAAAAQAAHAEAADSWYLALLGFAEHFRTS 

SPPKIRLCVHCLQAVFPFKPPQRIEARTHLQLGSV 

LYHHTKNSEQARSHLEKAWLISQQIPQFEDVKFE 

AASLLSELYCQENSVDAAKPLLRKAIQISQQTPY 

WHCRLLFQLAQLHTLEKDLVSACDLLGVGAEY 

ARVVGSEYTRALFLLSKGMLLLMERKLQEVHPL 

LTLCGQrVENWQGNPIQKESLRVFFLVLQVTPiYL 

DAGQVKSVKPCLKQLQQCIQTISTLHDDEILPSNP 

ADLFHWLPKEHMCVLVYLVTVMHSMQAGYLE 

KAQKYTDKALMQLEKLKMLDCSPILSSFQVILLE 

HIIMCRLVTGHKATALQEISQVCQLCQQSPRLFS 

NHAAQLHTLLGLYCVSVNCMDNAEAQFTTALR 

LLERINPDHSFPVSSHCLRAAAFYVRGLFSFFQGR 

YNEAKRFLRETLKMSNAEDLNRLTACSLVLLGHI 

FYVLG>JHRESNNMVVPAMQLASKIPDMSVQLW 

SSALLRDLNKACGNAMDAHEAAQMHQNFSQQL 

LQDHIEACSLPEHNLITWTDGPPPVQFQAQNGPN 

TSLASLL 


3493 


A 


3 


2024 


PNGVALLHLPGAAVIPNTNYMFQDALGGRSRGS 

REESPAPSRAPASASLWRRLVVVEAKMAAHAAA 

AAQAAAAQAAHAEAADSWYLALLGFAEHFRTS 

SPPKIRLCVHCLQAVFPFKPPQRIEARTHLQLGSV 

LYHHTKNSEQARSHLEKAWLISQQIPQFEDVKFE 

AASLLSELYCQENSVDAAKPLLRKAIQISQQTPY 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C=Cystcine, D»Aspartic Acid, 
E=Glutnmic Acid, F»Phenylal&nlne, G=Glycine, H=Histidine, 
I^Isoleucine, K=Ly$inei L»Leucine, M'^Metliionine, 
N^Asparagine, P^Proline, Q=Glutaniine, R=Arginine, S=Serine, 
T^Threonine, V=VaUne, W«Tryptophan, Y^Tyrosine, 
X^Unlcnown, *=Stop codon, /-possible nucleotide deletion, 
V=po5Sil}le nodeotide insertion 






• 




WHCRLLFQLAQLHTLEKDLVSACDLLGVGAEY 

ARVVGSEYTRALFLLSKGMLLLMERKLQEVHPL 

LTLCGQIVENWQGKPIQKESLRVFFLVLQVTHYL 

DAGQVKSVKPCLKQLQQCIQTISTLHDDEILPSNP 

ADLFHWLPKEHMCVLVYLVTVMHSMQAGYLE 

KAQKYTDKALMQLEKLKMLDCSPILSSFQVILLE 

HIIMCRLVTGHKATALQEISQVCQLCQQSPRLFS 

NHAAQLHTLLGLYCVSVNCMDNAEAQFTTALR 

LTNHOELWAFlVTNLASVYIREGNRHOEVV\LYS 

LLERINPDHSFPVSSHCLRAAAFYVRGLFSFFQGR 

YNEAKRFLRETLKMSNAEDLNRLTACSLVLLGHI 

FYVLGNHRESNNMVVPAMQLASKIPDMSVQLW 

SSALLRDLNKACGNAMDAHEAAQMHQNFSQQL 

LQDHIEACSLPEHNLITWTDGPPPVQFQAQNGPN 

TSLASLL 


3494 


A 


2 


1615 


VLRGQRGPAGGLAEERRRGRNEWRIHDVTTAPF 

PGLVQRRSRLLIVSQVRYFLKNKVSPDLCNEDGL 

TALHQCCIDNFEEIVKLLLSHGANVNAKDNELW 

TPLHAAATCGHINLVKILVQYGADLLAVNSDGN 

MPYDLCEDEPTLDVmTCMAYQGITQEKlNEMRV 

APEQQMIADIHCMIAAGQDLDWIDAQGATLLHI 

AGANGYLRAAELLLDHGVRVDVKDWDGWEPL 

HAAAFWGQMQMAELLVSHGANNLNARTSMDE 

MPIDLCEEEEFKVLLLELK\HKHDVIMKSQLRHK 

SSLSRRTSHRQAS/SVGKVVRRTQPVGTGPNL\YR 

TDQENKDPNPRLEK\PVLLSEFPTKIPRGELDMPV 

ENGLRAPVSAYQYALANGDVWKVHEVPDYSM 

AYGNPGVADATPPWSSYKEQSPQTLLELKRQRA 

AAKLLSHPFLSTHLGSSMARTGESSSEGKAPLIG 

GRTSPYSSNGTSVYYTVTSGDPPLLKFKAPIEEM 

EEKVHGCCRIS 


3495 


A 


327 


1078 


APMADTTPNGPQGAGAVQFMIVITNKLDTAMWL 

SRLFTV YCS A I .FVT.PLLGLHF A A SFYORALL AN A 

LTSALRLHQRLPHFQLSRAFLAQALLEDSCHYLL 

YSLIFVNSYPVTMSBFPVLLFSLLHAATYTKKVLV 

DARG\SNSLPLLR\SVLDKLSANQQNILKFIACNEI 

FLMPATVFMLFSGQGSLLQPFIYYRFLTLRYSSRR 

NPYCRTLFNELRIWEHIIMKPACPLFVRRLCLQS 

lAFISRLAPTVP 


3496 


A 


3 


2867 


SSRTREMEEKEILRRQIRLLQGLIDDYKTLHGNAP 

APGTPAASGWQPPTYHSGRAFSARYPRPSRRGYS 

SHHGPSWRKKYSLVNRPPGPSDPPADHAVRPLH 

GARGGQPPVPQQHVLERQVQLSQGQNWIKVKP 

PSKSGSASASGAQRGSLEEFEDTPWSDQRPREGE 

GEPPRGQLQPSRPTRARGTCSVEDPLLVCQKEPG 

KPRMVKSVGSVGDSPREPRRTVSESVIAVKASFP 

SSALPPRTGVALGRKLGSHSVASCAPQLLGDRRV 

DAGHTDOPVPSGSVGGPARPASGPROAREASLV 

VTCRTNKFRKNNYKWVAASSKSPRVARRALSPR 

VAAENVCKASAGMANKVEKPQLIADPEPKPRKP 

ATSSKPGSAPSKYKWKASSPSASSSSSFRWQSEA 

GSKDHASQLSPVLSRSPSGDVRPALAHSGLKPLSG 

ETPLSAYKVKTRTKIIRRRGSTSLPGDKKSGTSPA 

ATAKSHLSLRRRQALRGKSSPVLKKTPNKGLVQ 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A»Alanioe OCysteine, D^Aspartlc Add» 
E^Glutamic Acid, F^Phenyl alanine, G^Glycine, H=:Hlstidine, 
I=l5oleucine, K^'Lysine, L^Lcacine, M==MetfaioDine, 
N^Asparagine, P>=Proline, Q>=Glutamine, R-Arginihe, S>»Serine, 
T<=Threonine, V^Valine, W^^^ryptophan, Y^rosine, 
X^^UnlcnowD, *>=Stop codon, /=^ossible nocleotide deletion, 
\=possible nucleotide insertion 










VTKHRLCRLPPSRAHLPTKEASSLHAVRTAPTSK 

VIKTRYRTVKKTPASPLSAPPFPLSLPSWRARRLS 

LSRSLVLNRLRPVASGGGKAQPGSPWWRSKGYR 

CIGGVLYKVSANKLSKTSGQPSDAGSRPLLRTGR 

LDPAGSCSRSLASRAVQRSLAIIRQARQRREKRK 

EYCMYYNRFGRCNRGERCPYIHDPEKVAVCTRP 

VRGTCKKTDGTCPFSHHVSKEKMPVCSYFLKGI 
CSNSNCPYSHVYVSRKAEVCSDFLKGYCPLGAK 

CKKKHTLLCPDFARRGACPRGAQCQLLHRTQKR 

HSRRAATSPAPGPSDATARSRVSASHGPRKPSAS 

QRPTRQTPSSAALTAAAVAAPPHCPGGSASPSSS 

KASSSSSSSSSPPASLDHE\APSLQEAALAAACSN 

RLCKLPSFISLQSSPSPGAQPRVRAPRAPLTKDSG 

KPLHIBGPRL 


3497 


A 


1586 


141 

■ 


ATARDLGCARRIDRWMESTPSRGLNRVHLQCR 

NLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSL 

AKNWVMRMLFLEQPLPQAAVALWVKKEFSKA 

QEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQN 

LRIALLGGGKAWSDDTSQLGPDKHARDVPSLDK 

YAEERWEVVLHFMVGSPSAAVSQDLAQLLSQA 

GLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFM 

LQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYS 

VPfiA/f^DQT T "MFT OHT RFFHT VFDUK'Pl^^'RPVYP 
V jcvjiviOL^oj^jLfi^ r JUv^rxuivctr viju v r v^j\aajvi9 jvev i i it 

T/RALAINLSSGVSGAGGTVHQPGFIV\VETNYRL 

YAYTESELQIALIALFSEMLYPFPVNMVV\ARVTR\ 

ESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVL 

PPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDF 

ELLXLAHAPKLGVLVFE/NTPAKRLMWTPAGHS 

DVKRFWKROKHSS 


3498 


A 


790 


190 

• 


RDLGPAALMTASASSFSSSQGVQQPSIYSFSQITR 

SLFLSNGVAANDKLLLSSNRITAIVNASVGSGQRI 

LRG\LQYIKVPVTDARDSRLYDFFDPIADLIHTVS 

MRQGRTLLNCMAG\MSRSASLCLAYLMKYHSM 

SXLLDAHTWA/TKSRRPnRPNNGFWEQLINYEFK 

LFNNNTVRMINSPVGNIPDIYEKDLRMMISM 


3499 


A 


31 


1586 

• 


tagfllaplemqrlltpvkrilqltjiavqetslt 

parllpvahqrfstasavplaktdtwpkdvgil 

alevyfpaqyvdqtdlekynkveagkytvglg 

qtrmgfcsvqedinslcltvvqrlmeriqlpwd 

svgrlevgtetiidkskavktvlmelfqdsgntd 

legidttnacyggtaslfnaanwmessswdgry 

amvvcgdiavypsgnarptggagavamligpk: 

aplalerglrgthmenvydfykpnlaseypivd 

gklsiqcylraldrcytsyrkkiqnqwkqagsd 

rpftlddloymifhtpfckmvokslarlmfndf 

X M, I.H ^1 A XTXU XJL tk Mk X X^XXXTX w ^^J.^UXJ<U^JUXTXX X^ J^X 

lsassdtqtslykgleafgglkledtytnkdld 

kallkasqdmfdkktkaslylsthnghlmytssl 

ygclasllshhsaqelagsrigafsygsglaasf 

fsfrvsqdaapgspl\dklvsstsdlpkrlasrkc 

vspeefteimnqreqfyhkvnfsppgdtnslfpgt 

wylervdeqhrrkyarrpv 


3500 


A 


185 


2692 


mlptevpqshpgpsallllqlllpptsaffpniws 
llaapgsithqdlteeaalnvtlqlfleqpppgrp 
plrledflgrtlladdlfaayfgpgssrrfraal 
gevsranaaqdflptsrndpdlhfdaerlgqgr 
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SEQlD 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

toilrst ainiDO 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A»A!anine C<<^ysteine, D^Aspartic Acid, 
E^'Glutamic Acid, F^Phenylalonine, G=Glycinc, H='Histidine, 
I"l8oleucine, KpLysine, Lr^Leucine, M=Methioniney 
N»Asparagine, P^ProIine, Q=Glutamine, R«Arglnine, S=Serine, 
T^Threonine, V=Valinc, W=Tryptophan, Y=Tyrosine, 
X'^Unknown, ^^^Stop codon, /=possibIe nucleotide deletion, 
\spossible nucleotide insertion 










ARLVGALRETVVAARALDHTLARQRLGAALHA 

LQDFYSHSNWVELGEQQPHPHLLWPRQELQNLA 

QVADPTCSDCEELSCPRNWLGFTLLTSGYFGTHP 

PKPPGKCSHGGHFDRSSSQPPRGGrNKDSTSPGFS 

PHHMLHLQAAKLALLASIQAFSLLRSRLGDRDFS 

RLLDITPASSLSFVLDTTGSMGEEINAAKIQARHL 

VEQRRGSPMEPVHYVLVPFHDPGFGPVFTTSDPD 

SFWQQLNEIHALGGGDEPEMCLSALQLALLHTPP 

LSDIFVF-rUASPKDAFLTNQVESLTQERRCRVTFL 

VTEDTSRVQGRARREILSPLRFEPYKAVALASGG 

EVIFTKDQHIRDVAAIVGESMAALVTLPLDPPW 

VPGQPLVFSVDGLLQFaTVRIHGDISSFWIKNPAG 

VSQGQEEGGGPLGHTRRFGQFWMVTMDDPPQT 

GTWEIQVTAEDTPGVRVQAQTSLDFLFHFGIPME 

DGPHPGLYPLTOPVAGLOTOLLVEVTGLGSRAN 

PGDPQPHFSHVILRGVPEGAELGQVPLEPVGPPE 

RGLLAASLSPTLLSTPRPFSLELIGQDAAGRRLHR 

AAPQPSTVVPVLLELSGPSGFLAPGSKVPLSLRIA 

SFSGPQDLDLRTFVNPSFSLTSNLSRAHLELNESA 

WGRLWLEVPDSAAPDSVVMVTVtAGGREANPV 

PPTHAFLRLLVSAPAPQDRH 


3501 


A 


1245 


5815 


RRAHPSHSRLSPYLSVSRDPYFFVTVSRTILTLSA 

PAPPRRTPAPSMGTALLQRGGCFLLCLSLLLLGC 

WAELGSGLEFPGAEGQWTRFPKWNACCESEMSF 

QLKTRSARGLVLYFDDEGFCDFLELILTRGGRLQ 

LSFSIFCAEPATLLADTPVNDGAWHSVRIRRQFR 

NTTIJTDQ\nEAKWVEVKSKRRDMTVFSGLFVGG 

LPPELRAAALKLTL AS VREREPFKG WIRD VRVNS 

SQVLPVDSGEVKLDDEPPNSGGG\SPCEAGEEGE 

GGVCLNGGVCSVVDDQAVCDCSRTGFRGKDCS 

QEDNNVEGLAHLMMGDQGKEEYIATFKGSEYF 

CYDLSQNPIQSSSDEITLSFKTLQRNGLMLHTGKS 

ADYVNLALKNGAVSLVINLGSGAFEALVEPVNG 

KFNDNAWHDVKVTRNLRQHSGIGHAMVTISVD 

GILTTTGYTQEDYTMLGSDDFFYVGGSPSTADLP 

GSPVSNNFMGCLKEVVYKNNDVRLELSRLAKQ 

GDPKMKIHGVVAFKCENVATLDPITFETPESFISL 

PKWNAKKTGSISFDFR'llEPNGLILFSHGKPRHQ 

KDAKHPQMIKVDFFAIEMLDGHLYLLLDMGSGT 

IKIKALLKKVNDGEWYHVDFQRDGRSGTISVNT 

LRTPYTAPGESEILDLDDELYLGGLPENKAGLVF 

PTEVWTALLNYGYVGCIRDLFIDGQSKDIRQMA 

EVQSTAGVKPSCSKETAKPCLSNPCKMNGMCRD 

GWNRYVCDCSGTGYLGRSCEREATVXSYDGSM 

FMKIQLPWMHTEAEDVSLRFRSQRAYGELMAT 

TSRDSADTLRLELDAGR VKLTVNLDCIRINCNS S 

KGPETLFAGYNLNDNEWHTVRVVRRGKSLKLT 

VDDQQAMTGQMAGDHTRLEFHNIErGUTERRY 

LSSVPSNFIGHLQSLTFNGMAYIDLCICNGDIDYC 

ELNARFGFRNIIADPVTFKTKSSYVALATLQAYT 

SMHLFFQFKTTSLDGLILYNSGDGNDFIWELVK 

GYLHYVFDLGNGANLIKGSSNKPLNDNQWHNV 

MISRDTSNLHTVKIDTKITTQITAGAJWLDLKSDL 

YIGGVAKETYKSLPKLVHAKEGFQGCLASVDLN 

GVRLPVDLISDGSFSCNGTDSRRGMWKGPSmCQ 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


A.niino acid sequence (A>°Alanine C=Cysteine, D=>A.spartic Acid, 
E^Glutamic Add, F^Fiienylalanlne, G=Clycine, H^Histidine, 
l^lsoleucine, K^Lysiae, L=Leucine, M^Methionine, 
N^'Asparagine, P=ProIine, Q=Glutamine, R-Arginine, Serine, 
T-Threonine, V'^Valine, 1iV=>Tryptophan, Y=Tyrosine, 
X^Unlcnown, ^'^Stop codon,/^^p<»slble nucleotide deletion, 
\=possible nucleotide insertion 










EDSCSNQGVCLQQWDGFSCDCSMTSFSGPLCND 

PGTTYIFSKGGGQITYKWPPNDRPSTRADRLAIGF 

STVQKEAVLVRVDSSSGLGDYLELHIHQGKIGVK 

FNVGTDDIAIEESNAIINDGKYHWRFTRSGGNA 

lXQ\n)SWPVIERYPAGRQLTIFNSQATIIIGGKEQ 

GOPFOGOLSGLYYNGLKVLNMAAENDANIArVG 

>rV^RLVGEVPSSMTTESTATAMQSEMSTSIMETTT 

TLATSTARRGKPPTKEPISQTTDDILVASAECPSD 

DEDIDPCEPSSGGLANPTRAGGREPYPGSAEVIRE 

SSSTTGlVrWGIVAAAALCILILLYAMYKYRNRDE 

GSYHVDESRNYISNSAQSNGAVVKEKQPSSAKSS 

NKNKKNKDKEYYV 


3502 


A 


394 


72 


KJPAHLPFTVIIMPKRKPSEGAMSDKVKA/KFELQ 

RRSAGLFSKPTPPKPETRPKKDPANQRQKLPKVR 

ICGKADA/SKEGNSPAEERCSMVQTQKVEGWRSG 

SELPVALSF 


3503 

I 


A 


43 


3358 


SGGRGPVRVRSEQLSPSAEQVSQISQISLGRRPLS 

SLPPPPSRALAPTRAPDTALTIMEVAEVESPLNPS 

CKIMTFRPSMEEFREFNKYLAYMESKGAHRAGL 

AKVIPPKEWKPRQCYDDIDNLLIPAPIQQMVTGQ 

SGLFTQYNIQKKAMTVKEFRQLANSGKYCTPRY 

LDYEDLERKYWKNLTFVAPIYGADINGSIYDEGV 

DEWNIARLNTVLDWEEECGISIEGVNTPYLYFG 

MWKTTFAWHTEDMDLYSINYLHFGEPKSWYAIP 

PEHGKRLERLAQGFFPSSSQGCDAFLRHKMTLIS 

PSVLKKYGIPFDKITQEAGEFMITFPYGYHAGFN 

HGFNCAESTNFATVRWIDYGKVAKLCTCRKDM 

VKISMDIFVRKFQPDRYQLWKQGKDr/TIDHTKP 

TPASTPEVKAWLQRRRKVRKASRSFQCARSTSK 

RPKADEEEEVSDEVDGAEVPNPDSVTDDLICVSE 

KSEAAVKLRNTEASSEEESSASRMQVEQNLSDHI 

KLSGNSCLSTSVTEDIKTEDDKAYAYRSVPSISSE 

ADDSIPLSTGYEKPEKSDPSELSWPKSPESCSSVA 

ESNGVLTEGEESDVESHGNGLEPGEIPAVPSGER 

NSFKVPSIAEGENKTSKSWRHPLSRPPARSPMTL 

VKQQAPSDEELPEVLSIEEEVEETESWAKPLIHL 

WQTfCPPNFAAEQEYNATVARMKPHCAICTLLMP 

YHKPDSSNEENDARWETFCLDEVVTSEGKTKPLIP 

EMCFIYSEENIEYSPPNAFLEEDGTSLLISCAKCC 

VRVHASCYGIPSHEICDGWLCARCKRNAWTAEC 

CLCNLRGGALKQTKNNKWAHVMCAVAVPEVR 

FTOVPERTQIDVGRIPLQRLKLKCIFCRHRVKRVS 

GACIOC S YGRCPASFHVTCAHAAG VLVMEPDD W 

PYWNITCFRHKVNPNVKSKACEKVISVGQTVIT 

KHRNTRYYSCRVMAVTSQTFYEVMFDDGSFSRD 

TFPEDIVSRDCLKLGPPAEGEVVQVKWPDGKLY 

GAKYFGSMAHMYQVEFEDGSQIAMKREDFVTL 

DEELPKRVKARFVSAGRCHLGTCQVNSLSSPHVS 

QAQQETYLGFWINSKKSQCNIFLSGTY 


3504 


A 


1124 


139 


RGEEQFDAEFRRPACLGFGERLQEFSRLLRAVHR 

SRAWTCYLAIRMLMATCCPSPTTTACTGPWQRA 

PPLRLLVQKREADSSGLAFASNSLQRRKKGLLLR 

PVAPLRTRPPLLISLPQDFRQVSSVIDVDLLPETH 

RRVRLHKHGSDRPLGFYIRDGMSVRVAPQG\LER 

VPGIFISRLVRGGLAESTGLLAVSDEILEVNGIEV 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^/Vianine C<=Cysteine) D»Aspartic Acid, 
E^lutamic Acid, F^Phenylalanine, &=Glycine, H-HistidiDe, 
I=Isoleucine, K— Lysine, L=Leucine, M'=Metliionine, 
N^Asparagine, P^Prolinc, Q^Glutamine, RnArginine, SsSerine, 
T-Threonine, V=VaIine, W»Tryptophan, Yx-Tyroslne, 
X-Unlcnoi;rn, *'=Stop codon, A^ossible nucleotide deletion, 
\=possible nucleotide insertion 










AGKTLNQVTDMMVANSHN\LIVTVKPANQRNN 
VVRGASGRLTGPPSAGPGPAEPDSDDDSSDLVIE 
NRQPPSSNGLSQGPPCWDLHFGCRHPGIRSSLPS 
LDDQEQASSGWGSRIRGDGSGFSL 


3505 


A 


3 


2898 


SCRSATSQSGCGGGRSWLCSSLKMAAQPPRGIRL 

SALCPKFLHTNSTSHTWPFSAVAELIDNAYDPDV 

NAKQIWlDKTVINDfflCLTFTDNGNGMTSDKLH 

KMLSFGFSDKVTMNGHVPVGLYGNGFKSGSM\R 

LGKDAIVFTKNGESMSVGLLSQTYLNEVIKAEHV 

VVPIVAFNKHRQMINLAESKASLAAILEHSLFSTE 

QKLLAELDAIIGKKGTRIIIWNLRSYKNATEFDFE 

KDKYDIRIPEDLDEITGKKGYKKQERMDQIAPES 

DYSLRAYCSILYLKPRMQIILRGQKVKTQLVSKS 

LAYIERDVYRPKFLSKTVRITFGFNCRNKDHYGI 

MMYHRNRLIKAYEKVGCQLRANNMGVGVVGn 

Ea^OO-KPTHNKQDFDYTNEYRLTITALGEKLND 

YWNEMKVKKKreYPLNLPVEDIQKRPDQTWVQ 

CDACLKWRiaPDGMIXJIPEKWYCSNNPVDPQFR 

NCEVPEEPEDEDLVHPTYEKTYKKTNKEKFRIRQ 

PEMIPRINAELLFRPT\ALSTPS\FSSPKESVSKR/RH 

LSEGTNSYATRLLNNHQVPPQSEPESNSLKRRLS 

TRSSILNAKNRRL\SSQF\ENSVYKG\DDDDEDVII 

LEENSTPKJPAYDHDIDMKSEQSHVEQGGVQVEF 

VGDSEPCGQTGSTSTSSSRCDQGNTAATQTEVPS 

LVVKKEETVEDEDDVRNDAVILPSCVEAEAKIHE 

TQETTDKSADDAGCQLQELRNQLLLVTEEKENY 

KROCHMFTDOIKVLOORILENlNDKYVKKFTrH 

QSTETDAVFLLESINGKSESPDHMVSQYQQALEE 

lERLKKQCSALQHVKAECSQCSNNESKSEMDEM 

AVQLDDVFRQLDKCSIERDQYKSEVELLEMEKS 

QIRSQCEELKTEVEQLKSTNQQTATDVSTSSNIEE 

SVNHMDGESLKLRSLRVNVGQLLAMIVPDLDLQ 

QVNYDVDWDEILGQWEQMSEISST 


3506 


A 


2 


2120 


RPPEAGGRYRAGGRRQAAKPSRPPLPSRRRLPQG 

GRTRRAMDRPAAAAAAGCEGGGGPNPGPAGGR 

RPPRAAGGATAGSRQPSVETLDSPTGSEVEWCK 

QLIAATISSQISGSVTSENVSRDYKALRDGNKLA 

QMEEAPLFPGESIKAIVKDVMYICPFMGAVSGTL 

TVTDFKLYFKNVERDPHFILDVPLGVISRVEKIGA 

QSHGDNSCGIEIVCBa)MRNLRLAYK\QEEQSKLG 

IFENLNKHAFPLSNGQALFAFSYKEKFPINGWKV 

YDPVSEYKRQGLPNESWKISKINSNYEFCDTYPA 

nWPTSVKDDDLSKVAVFLAKGRVPVLSWIHPE 

SQATITRCSQPLVGPNDKRCKEDEKYLQTIMDAN 

AQSHKLUFDARQNSVADTNKTKGGGYESESAYP 

NAELVFLEIH3OT1VN4RESLRKLKEIVYPSIDEARW 

LSNVDGTHWLEYIRMLLAGAVRIADKIESGKTSV 

VVHCSDGWDRTAOLTSLAMLMLDSYYRTIKGFE 

TLVEKJBWISFGHRFALRVGHGhDNHADADRSPIF 

LQFVDCVWQMTRQFPSAFEFNELFLinLDHLYS 

CLFGTFLCNCEQQRFKEDVYTKTISLWSYINSQL 

DEFSNPFFVNYENHVLYPVASLSHLELWVNYYV 

RWNPRMRPQMPIHQNLKELLAVRAELQKRVEG 

LQREVATRAVSSSSERGSSPSHFATSVHTLV 


3507 


A 


1 


2169 


GSSIKIRLTVLCAKNLAKKDFFRLPDPFVVKIVVD 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A»Alanine OCysteine, D^Aspartic Acid, 
E=Glutaniic Acid, F^Phenylalanine, G=Glycine, H>=Histidine, 
Msoleucine, K'^Lysine, L»Leucine, M»Methionine, 
N=Asparagitte, P«FrolIne, Q^lutamine, R=Ai:ginine, S==Serioe, 
T'^OIireonine, V«Valine, W^Tryptophan, Y»Tyrosine, 
X»Unknown, *s:Stop codon,A=possible nucleotide deletion, 
\=possible nucleotide insertion 










GSGQCHSTDTVKNTLDPKWNQHYDLYVGICTDSI 

TISVWNHKKIHKKQGAGFLGCVRLLSNAISRLKD 

TGYQRLDLCKLNPSDTDAVRGQIVVSLQTRDRIG 

TGGSWDCRGLLENEGTVYEDSGPGRPLSCFME 

EPAPYTDSTGAAAGGGNCRFVESPSQDQRLQAQ 

RLRNPDVRGSLQTPQNRPHGHQSPELPEGYEQRT 

TVQGQVYFLHTQTGVSTWHDPRIPRDL>JSVNCD 

ELGPLPPGWEVRSTVSGRIYFVDHNNRTTQFTDP 

RLHHIMNHQCQLKJEPSQPLPLPSEGSLEDEELPA 

QRYERDLVQKLKVLRHELSLQQPQAGHCRIEVS 

REEIFEESYRQIMKMRPKDLKBCRLMVKFRGEEG 

LDYGGVAREWLYLLCHEMLNPYYGLFQYSIDNr 

YMLQINPDSSINPDHLSYFHFVGRIMGLAVFHGH 

YINGGFTVPFYKQLLGKPIQLSDLESVDPELHKSL 

G\RNVPVTEENKKEYVRLYVNWRFMRGIEAQFL 

ALQKGFNELIPQHLLKFFDQBCELELnOGLDKIDL 

NDWKSNTRLKHCVADSNIVRWFWQAVETFDEE 

RRARLLQFVTGSTRVPLQGFKALQGSTG\AAGPR 

LFTIHLIDANTDNLRKAHTCFNRIDIPPYESYEKL 

YEKLLTAVEETCGFAVE 


3508 


A 


3 


6388 


ILYINPADLGWNPPVSSWIEKREIQTBRANLTILF 

DKYLPTCLDTLRTRFKKIIPIPEQSMVQNIVCHLLE 

CLLTTEDIPADCPKEIYEHYFVFAAIWAFGGAMV 

QDQLVDYRAEFSKWWLTEFKTVKFPSQGTIFDY 

YIDPETKKFEPWSKLVPQFEFDPEMPLQACLVHT 

SETIRVCYFMERLMARQRPVMLVGTAGTGKSVL 

VGAKLASLDPEAYLVKNVPFNYYTTSAMLQAVL 

EKPLEKKAGRl^GPPGNKKLFYFroDMNMPEVD 

AYGTVQPHTIIRQHLDYGHWYDRSKLSLKEITNV 

QYVSCMNPTAGSFTINPRLQRHFSVFVLSFPGAD 

ALSSIYSIILTQHLKLGNFPASLQKSIPPLIDLALAF 

HQKIA rrFLPTGIKFHYIFNLRDFANIFQGILFSSV 

ECVKSTWDLIRLYLHESNRVYRDKMVEEKDFDL 

FDKIQTEVLKKTFDDIEDPVEQTQSPNLYCHFAN 

GIGEPKYMPVQSWELLTQTLVEALENHNEVNTV 

MDLVLFEDAMRHVCHINRILESPRGNALL VGVG 

GSGKQSLTRLAAFISSMDVFQITLRKGYQIQDFK 

MDLASLCLKAGVKNLNTVFLMTDAQVADERFL 

VLINDLLASGEIPDLYSDDEVENIISNVRNEVKSQ 

GLVDNRENCWKFFIDRIRRQLKVTLCFSPVGNKL 

RVRSRKFPAIVNCTAIHWFHEWPQQALESVSLRF 

LQNTEGIEPTVKQSISKFMAFVHTSVNQTSQSYLS 

NEQRYNYTTPKSFLEFIRLYQSLLHRHRKELKCK 

TERLENGLLKLHSTSAQVDDLKAKLAAQEVELK 

QKNEDADKLIQVVGVETDKVSREKAMADEEEQ 

KVAVIMLEVKQKQKDCEEDLAKAEPALTAAQA 

ALNTLNKTNLTELKSFGSPPLAVSNVSAAVMVL 

MAPRGRVPKDRSWKAAKVTMAKVDGFLDSLIN 

FNKENIHENCLKAIRPYLQDPEFNPEFVATKSYA 

AAGLCSWVINIVRFYEVFCDVEPKRQALNKATA 

DLTAAQEKLAAIKAKIAHLNENLAKLTABFEKA 

TADKLKCQQEAEVTAVnSLANRLVGGLASENV 

RWADAVQNFKQQERTLCGDILLlTAFISYLGFFr 

KKYRQSLLDRTWRPYLSQLKTPIPVTPALDPLRM 
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SEQO) 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (ABAianine C"Cysteine» I>=Aspartic Acld» 
£'=Clutaoiic Add, F^Fbenylalanine, G=Glycine, H=>HistidiDe, 
I^Isoieucine, K-Lyslne, L^LeucinC) M='Methionine, 
N=Asparagine» F=Prollne, Q=Glutanilne, R"Arginine, S==Serine, 
T^Threonine, V«Valine, W^Tryptophan, Y«Tyrosine, 
X^Unknown, ^'^Stop codon, /^possible nucleotide deletion, 
\=pos5il)le nucleotide insertion 










LMDDADVAAWQNEGLPADRMSVENATILINCE 

RWPLMVDPQLQGIKWIKNKYGEDLRVTQIGQKG 

YLQIIEQALEAGAVVLIENLEESIDPVLGPLLGRE 

VIKKGRFIKIGDKECEYNPKFRLILHTKLANPHYQ 

PELQAQATLINFTVTRDGLEDQLLAAVVSMERP 

DLEQLKSDLTKQQNGFKITLKTLEDSLLSRLSSA S 

GNFLGETVLVENLEITKQTAAEVEKKVQEAKVT 

EVXINEAREHYRPAAARASLLYFIMNDLSKIHPM 

YQFSLKAFSr/FQKAVERAAPDESLRERVANLID 

SITFSVYQYTIRGLFECDKLTYLAQLTFQILLMNR 

EVNAVELDFLLRSPVQTGTASPVEFLSHQAWGA 

VKVLSSMEEFSNLDRDIEGSAKSWKKFVESECPE 

KEKLPQEWKNKTALQRLCMLRAMRPDRMTYAL 

RDFVEEKLGSKYWGRALDFATSFEESGPATPMF 

FILSPGVDPLKDVESQGRKLGYTFNNQNFHNVSL 

GQGQEWAEAALDLAAKKGHWVILQNTLEMCS 

RETEFKSILFALCYFHAWAERRKFGPQGWNRSY 

PFNTGDLTISVNVLYNFLEANAKVPYDDLRYLFG 

EIMYGGHITDDWDRRLCRTYLGEFIRPEMLEGEL 

SLAPGFPLPGNMDYNGYHQYIDAELPPESPYLYG 

LHPNAEIGFLTQTSEKLFRTVLELQPRDSQARDG 

AGATREEKVKALLEEILERVTDEFNIPELMAKVE 

ERTPYIVVAFQECGRMNILTREIQRSLRELELGLK 

wEiL 1 M 1 orlJMLbNIjVlN AL I r UM V JrJbo W AK^ 

TAGLAAWFPDLLNRIKELEAWTGDFTMPSTVWL 

TGFFNPQSFLTAIMQSTARKNEWPLDQMALQCD 

MTKKNREEFRSPPREGAYIHGLFMEGACWDTQA 

GIITEAKLKDLTPPMPVMFIKAIPAD\RQDCGHVY 

SCPVTKTSQVRDPTYVWTFNLKTKENPSKWVLA 

GVALLLQI 


3509 


A 


3 


6388 


DLYINPADLGWNPPVSSWIEKREIQTERANLTILF 

DKYLPTCLDTLRTRFKKIIPIPEQSMVQMVCHLLE 

CLLTTEDIPADCPKEIYEHYFVFAAIWAFGGAMV 

QDQL\T)YRAEFSKWWLTEFKTVKFPSQGTIFDY 

YIDPETKKFEPWSKLVPQFEFDPEMPLQACLVHT 

SETIRVCYFMERLMARQRPVMLVGTAGTGKSVL 

VGAKLASLDPEAYLVKNVPFNYYTTSAMLQAVL 

EKPLEKKAGRNYGPPGNKKLIYFIDDMNMPEVD 

AYGTVQPHTIIRQHLDYGHWYDRSKLSLKEITNV 

QYVSCMNPTAGSFTINPRLQRHFSVFVLSFPGAD 

ALSSIYSnLTQHLKLGNFPASLQKSEPPLIDLALAF 

HQKIA ITFLPTGIKFHYIFNLRDFANIFQGILFSSV 

ECVKSTWDLIRLYLHESNRVYRDKMVEEKDFDL 

FDKIQTEVLKKTFDDIEDPVEQTQSPNLYCHFAN 

GIGEPKYMPVQSWELLTQTLVEALENHNEVNTV 

MDLVLFEDAMRHVCHINRILESPRGNALLVGVG 

GSGKQSLTRLAAFISSMDVFQITLRKGYQIQDFK 

MDLASLCLKAGVKNLNTVFLMTDAQVADERFL 

VLINDLLASGEEPDLYSDDEVENIISNVRNEVKSO 

GLVDNRENCWKFFIDRIRRQLKVTLCFSPVGNKL 

RVRSRKFPAIVNCTAIHWFHEWPQQALESVSLRF 

LQNTEGffiPTVKQSISKFMAFVHTSVNQTSQSYLS 

NEQRYNYTTPKSFLEFIRLYQSLLHRHRKELKCK 

TERLENGLLKLHSTSAQVDDLKAKLAAQEVELK 

QKNEDADKLIQVVGVETDKVSREKAMADEEEQ 
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S£QU> 
NO: 


Metbod 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucieotidc 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C^K^steine, D=^Aspartic Acid, 
E^Glutamic Add, F^Fhenylalanine, G^GIydne, H=Histidine, 
I=^Isoleucine, K^Lysine, L>»Leucine, M~Methionine, 
N==AsparagiRe« P^Proline, Q^GIutamine, RaArginine, S^erinei 
T»Threonine, V-Valine, W^Tryptophan, Y«Tyrosine, 
X»Unknown, *=Stop codon, /^possible nucleotide deletion, 
V^possible nudeotide insertion 

• 




• 






KVAVIMLEVKQKQKDCEEDLAKAEPALTAAQA 

ALNTLNKTNLTELKSFGSPPLAVSNVSAAVMVL 

MAPRGRVPKDRSWKAAKVTMAKVDGFLDSLIN 

FNKENIHENCLKAIRPYLQDPEFNPEFVATKSYA 

AAGLCSWVINIVRFYEVFCDVEPKRQALNKATA 

DLTAAQEKJJVAIKAKIAHLNENLAKLTARFEKA 

TADKLKCQQEAEVTAVTISLANRLVGGLASElSfV 

RWADAVQNFKQQERTLCGDILLITAFISYLGFFT 

KKYRQSLLDRTWRPYLSQLKTPIPVTPALDPLRM 

LMDDADVAAWQNEGLPADRMSVENATILINCE 

RWPLMVDPQLQGDCWIKNKYGEDLRVTQIGQKG 

YLQnEQALEAGAWLIENLEESIDPVLGPLLGRE 

VIKKGRFIBaGDKECEYNPKFRLILHTKLANPHYQ 

PELQAQATLINFTVTRDGLEDQLLAAWSMERP 

DLEQLKSDLTKQQNGFKITLKTLEDSLLSRLSSAS 

GNFLGETVLVENLEITKQTAAEVEKKVQEAKVT 

EVKTNEAREHYRPAAARASLLYHNfNDLSKIHPM 

YQFSLKAFSIVFQKAVERAAPDESLRERVANLID 

SITFSVYQYTIRGLFECDKLTYLAQLTFQILLMNR 

EVNAVELDFLLRSPVQTGTASPVEFLSHQAWGA 

VKVLSSMEEFSNLDRDIEGSAKSWKKFVESECPE 

KEKLPQEWKNKTALQRLCMLRAMRPDRMTYAL 

RDFVEEKLGSKYWGRALDFATSFEESGPATPMF 

FILSPGVDPLKDVESQGKKLGYTFNNQNFHNVSL 

GQGQEWAEAALDLAAKKGHWVILQNTLEMCS 

RETEFKSELFALCYFHAVVAERRKFGPQGWNRSY 

PFNTGDLTISVNVLYNFLEANAKVPYDDLRYLFG 

EIMYGGHITDDWDRRLCRTYLGEFIRPEMLEGEL 

SLAPGFPLPGNMDYNGYHQYIDAELPPESPYLYG 

LHPNAEIGFLTQTSEKLFRTVLELQPRDSQARDG 

AGATREEKVKALLEEILERVTDEFNIPELMAKVE 

ERTPYIWAFQECGRMNELTREIQRSLRELELGLK 

GELTMTSHMENLONALYFDMVPESWARRAYPS 

TAGLAAWFPDLLNRIKELEAWTGDFTMPSTVWL 

TGFFNPQSFLTAIMQSTARKNEWPLDQMALQCD 

MTKKNREEFRSPPREGAYIHGLFMEGACWDTQA 

GHIEAKLKDLTPPMPVMFIKAIPADVRQDCGHVY 

SCPVTKTSQ\RDPTYVWTFNLKTK£NPSKWVLA 

GVALLLQI 


3510 


A 


390 


3330 


AAGSGSRPPAPAARKMADLAECNIKVMCRFRPL 

NESEVNRGDKYIAKFQGEDTVVIASKPYAFDRVF 

QSSTSQEQVYNDCAKKIVKDVLEGYNGTIFAYG 

QTSSGKTHTMEGKLHDPEGMGIIPRIVQDIFNYIY 

SMDENLEFHIKVSYFEIYLDKIRDLLDVSKfNLSV 

HEDKNRVPYVKGCTERFVCSPDEVMDTIDEGKS 

NRHVAVTNMNEHSSRSHSIFLINVKQENTQTEQK 

LSGKLYLVDLAGSEKVSKTGAEGAVLDEAKNIN 

KSLSALGNVISALAEGSTYVPYRDSKMTRILQDS 

LGGNCRTTIVICCSPSSYNESETKSTLLFGQRAKTI 

KmYCVNVELTAEQWKKKYEKEKEKhfKILR>rn 

QWLENELNRWRNGETVPIDEQFDKEKANLEAFT 

VDKDITLTNDKPATAIGVIGNFTDAERRKCEEEIA 

KLYKQLDDKDEEINQQSQLVEKLKTQMLDQEEL 

LASTRRDQDNMQAELNRLQAENDASKEEVKEV 

LQALEELAVNYDQKSQEVEDKTKEYELLSDELN 
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SEQ ID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

locatioji 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
correspoiHling 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C=Cystcinc, D^^Aspartic Acid, 
f>=Glutan)ic Add, F^Phenylalanine, G«Glydne, H-Histidine, 
l»Isoieucine, K^^Lysine, Lr^Leudnc, MaMetbionine, 
N^^Asparagine, P-Proline, Q^GIutamine, R=Arginine, S=<5criae, 
T^Threonine, V=Vallne, W^Tryptophan, Y=Tyrosine, 
X^Unfcnown, *»Stop codon, ^=possiblc nucleotide deletion, 
\»pQssible nucleotide insertion 




• 






QKSATLASIDAELQKLKEMThmQKKRAAEMMA 

SLLKDLAEIGIAVGNNDVKQPEGTGMIDEEFTVA 

RLYISKMKSEVKTMVIORCKQLESTQTESNKKME 

ENEKELAACQLRISQHEAKIKSLTEYLQNVEQKK 

RQLEESVDALSEELVQLRAQEKVHBMEKEHLNK 

VQTANEVKQAVEQQIQSHREIHQKQISSLRDEVE 

AKAKLITDLQDQNQKMMLEQERLRVEHEKLKA 

TDQEKSRKLHELTVMQDRREQARQDLKGLEETV 

AKELQTLHNLRKLFVQDLATRVKKSAEIDSVDDT 

GGSAAQKQKISFLENNLEXQLTKSAQTSWYRDNA 

DLRCELPKLEKRLRATAERVKALESALKEAKEN 

ASRDRKRYQQEVDRIKEAVRSKNMARRGHSAQI 

AKPIRPGQHPAASPTHPSAIRGGGAFVQNSQPVA 

VRGGGGKQV 


3511 


A 


1 


1757 


MASVQASRRQWCYLCDLPKMPWAMVWDFSEA 

VCRGCVNFEGADRIELLIDAARQLKRSHVLPEGR 

SPGPPALKHPATKDLAAAAAQGPQLPPPQAQPQP 

SGTGGGVSGQDRYDRATSSGRLPLPSPALEYTLG 

SRLANGLGREEAVAEGARRALLGSMPGLMPPGL 

LAAAVSGLGSRGLTLAPGLSPARPLFGSDFEKEK 

QQRNADCLAELNEAMRGRAEEWHGRPKAVREQ 

LLALSACAPFNVRFKKDHGLVGRVFAFDATARP 

PGYEFELKLFTEYPCGSGNVYAGVLAVARQMFH 

DALREPGKALASSGFKYLEYERRHGSGEWRQLG 

ELLTDGVRSFREPAPAEALPQQYPEPAPAALCGP 

PPRAPSRNLAPTPRRRKASPEPEGEAAGKMTTEE 

QQQRHWVAPGGPYSAETPGVPSPIAALKNVAEA 

LGHSPKDPGGGGGPVRAGGASPAASSTAQPPTQ 

HRLVARNGEAEVSPTAGAEAVSGGGSGTGATPG 

APLC\CTLCRERLEDTHFVQ\CPPVPEHKFCFPCSR 

KFEKAQGPAGEWYCPSGDKCPLVGSSVPWAFMQ 

GEIATILAGDIKVKKERDP 


3512 

• 


A 


3 


1994 


NTNSSSVTNSAAGVEDLNIVQVTVPDNEKERLSS 

lEKIKQLREQVNDLFSRKFGEAIGVDFPVKVPYR 

KITFNPGCWIDGMPPGWFKAPGYLEISSMRRIL 

EAAEFIKFTVIRPLPGLELSNGEYSTVGKRKIDQE 

GRVFQEKWERAYFFVEVQNISTCLICKRSMSVSK 

EYNLRRHYQTNHSKHYDQYMERMRDEKLHELK 

KGLRKYLLGLSDTECPEQKQVFANPSPTQKSPVQ 

PVEDLAGNLWEKLREKIRSFVAYSIAIDErTDINN 

TTQLAIFIRGVDENFDVSEELLDTVPMTGTKSGN 

EIFSRVEKSLKNFCINWSKLVSVASTGTPPMVDA 

NNGLVTKLKSRVATFCKGAELKSICCUHPESLCA 

Q\KLmDH\nVIDVVVKSVNWICSRGLNHSEFTTL 

LYELDSQYGSLLYYTEIKWLSRGLVLKRFFESLE 

EIDSFMSSRGKPLPQLSSIDWIRDLAFLVDMTMH 

LNALNISLQGHSQIVTQMYDLIRAFLAKLCLWET 

HLTRNNLAHFPTLKLVSRNESDGLNYPKIAELK 

TEFQKRLSDFKLYESELTLFSSPFSTKIDSVHEELQ 

MEVIDLQCNTVLKTKYDKVGIPEFYKYLWGSYP 

KYKHHCAKmSMFGSTYICEQLFSIMKLSKTKYC 

SQLKDSQWDSVLHIAT 


3513 


A 


1836 


513 


FKSLLSVKWFCFSILVLIFLGTRCYWEMTQSRPSP 
DPHRGRWEGGRSRPKGGEEGRRRTRVPGLVTAS 
GPGNPLPDRLGEMAGGRHRRWGTLHLLLLVAA 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C^ysteine, D"Aspartic Acid, 
E=Glutamic Add, F=Pbenylalanine, GKSIycine, H==Histidine, 
lBlsoIeticine» K<"Lysine, L»Leucine, M-Methiofilne, 
N°Asparagine, P^Proline, Q=Glutamine, R«Arginine, S=SeriDC, 
T°Ttireooine, V=Valinc W=Tryptophan, Y=Tyrosine, 
X=Unlmown, ^'^^top codon, ^possible nucleotide deletion, 
\ppossible nucleotide insertion 










LPWASRGVSPSASAWPEEKNYHQPAILNSSALRQ 

lAEGTSISEMWQNDLQPLLIERYPGSPGSYAARQ 

HIMQRIQRLQADWVLEIDTFLSQTPYGYRSFSNII 

STLNPTAKRHLVLACHYDSKYFSHWXNNRVFVG 

ATDSAVPCAMMLELARALDKKLLSLKTVSDSKP 

DLSLQLIFFDGEEAFLHWSPQDSLYGSRHLAAKM 

ASTPHPPGARGTSQLHGMDLLVLLDLIGAPNPTF 

PNFFPNSARWFERLQAIEHELHELGLLKDHSLEG 

RYFQNYSYGGVIQDDHIPFLRRGVPVLHLIPSPFP 

EVWHTMDDNEENLDESTIDNLNKILQVFVLEYL 

HL 


3514 


A 


1836 


513 


FKSLLSVKWFCFSILVLIFLGTRCYWEMTQSRPSP 

DPHRGRWEGGRSRPKGGEEGRRRTRVPGLVTAS 

GPGNPLPDRLGEMAGGRHRRWGTLHLLLLVAA 

LPWASRGVSPSASAWPEEKNYHQPAILNSSALRQ 

lAEGTSISEMWQNDLQPLLIERYPGSPGSYAARQ 

HIMQRIQRLQADWVLEIDTFLSQTPYGYRSFSNn 

STLNFTAKRHLVLACHYDSKYFSHWNNRVFVG 

ATDSAVPCAMMLELARALDKKLLSLKTVSDSKP 

DLSLQLIFFDGEEAFLHWSPQDSLYGSRHLAAKM 

ASTPHPPGARGTSQLHGMDLLVLLDLIGAPNPTF 

PNFFPNSARWFERLQAIEHELHELGLLKDHSLEG 

RYFQNYSYGGVIQDDHIPFLRRGVPVLHLIPSPFP 

EVWHTMDDNEENLDESTIDNLNKILQVFVLEYL 

HL 


3515 


A 


1 14 


754 


LCRDLTTTMSSKRTKTKTKKRPORATS>fVFAMF 

DQSQIQEFKEAFNMIDQNRDGFIDKEDLHDMLAS 

LGKNPTDEYLDAMMNEAPGPINFTMFLIWGEK 

LNGTDPEDVIRNAFACFDEEATGTIQEDYLRELL 

TT^GDRF\TDE\EVDELYREAP1\DKKGGIFNYI\E 

FTRHLETGGPKDKDDRKITFQIPSPNVPWLArFG 

VFLEEFLLHGP 


3516 


A 


1 


5169 


MAAAPSALLLLPPFPVLSTYRLQSRSRPSAPETDD 

SRVGGIMRGEKNYYFRGAAGDHGSCfTTTSPLA 

SALLMPSEAVSSSWSESGGGLSGGDEEDTRLLQL 

LRTARDPSEAFQALQAALPRRGGRLGFPRRKEAL 

YRALGRVLVEGGSDEKRLCLQLLSDVLRGQGEA 

GQLEEAFSLALLPQLWSLREENPALRXDALQIL 

HICLKRSPGEVLRTLIQQGLESTDARLRASTALLL 

PILLTTEDLLLGLDLTEVIISLARKLGDQETEEESE 

TAFSALQQIGERLGQDRFQSYISRLPSALRRHYN 

RRLESQFGSQVPYYLELEASGFPEDPLPCAVTLS 

NSNLKFGIIPQELHSRLLDQEDYKNRTQAVEELK 

QVLGKFNPSSTPHSSLVGFISLLYNLLDDSNFKVV 

HGTLEVLHLLVIRLGEQVQQFLGPVIAASVKVLA 

DNKLVIKQEYMKIFLKLMKEVGPQQVLCLLLEH 

LKHKHSRVREEVVNICICSLLTYPSEDFDLPKLSF 

DLAPALVDSKRRVRQAALEAFAVLASSMGSGKT 

SILFKAVDTVELQDNGDGVMNAVQARLARKTLP 

RLTEQGFVEYAVLMPSSAGGRSNHLAHGADTD 

WLLAGNRTQSAHCHCGDHVRDSNffnYGSYSPTI 

CTRRVLSAGKGKNKLPWENEQPGIMGENQTSTS 

mEQFSTYDFIPSAKLKLSQGMPVNDDLCFSRK 

RVSRNLFQNSRDFNPDCLPLCAAGTTGTHQTNLS 

GKCAQLGFSQICGKTGSVGSDLQFLGTTSSHQEK 
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SEQ ID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A»Alanine C^Cysteine, D^Aspartic Acid, 
EKSIutamic Acid, F=Ptienylalanine, G^GIycine, U«Histidine, 
I^lsoieucine, K«^Lysine, L^Leucine, M-Methionlne, 
N»Asparagine, P^Proline, Q^Glatamioey R«^rginioe,S=SeriQe, 
T-Threonine, V=VaUne, W=Tryptophan, Y=Tyroslne, 
X-UnI(nown, *^top codon, /^possible nucleotide deletion, 
V^possible nucleotide insertion 










VYASLNFGSKTQQTFGSQTECTSSNGQNPSPGAY 

ILPSYPVSSPRTSPKHTSPLIISPKKSQDNSVNFSNS 

WPLKSFEGLSKPKSHRRSLSAQKSS\DPTGR\NHG 

\ENSQEKPP\VQLTPAL\VRSPSSRRGLNGTKPVPPI 

PNRGISLLPDKADLSWGHKKKEPDDIWKCEKDS 

LPmLSELOTKDKDLDQEEMHSSLRSLRNSAAKK 

RAKLSGSTSDLESPDSAMKLDLTMDSPSLSSSPNI 

NSYSESGVYSQESLTSSLSrrPQGKRIMSDIFPTFG 

SKPCPTRLSSAKXKISHIAEQSPSAGSSSNPQQISS 

FDFTTTKALSEDSVVVVGKGVFGSLSSAPATCSQ 

SVISSVENGDTFSIKQSIEPPSGIYGRSVQQNISSYL 

DVENEKDAKVSISKSTYNKMRQKRKEEKELFHN 

KDCEICKEKNSWERMRHTGTEKMASESETPTGAI 

SQYKERMPSVTHSPEIMDLSELRPFSKPEIALTEA 

LRLLADEDWEKKIEGLNFIRCLAAFHSEILNTKL 

HETNFAWQEVKNLRSGVSRAAWCLSDLFTYL 

KKSMIXJELDTTVKVLLHKAGESNTFIREDVDKA 

LRAMVNNVTPARAVVSLINGGQRYYGRKMLFF 

MMCHPNFEKMLEKYVPSKDLPYIKDSVRNLQQK 

GLGEIPLDTPSAKGFOISHTGSVGNTRSSSVSRDA 

FNSAERAVTEVREVTRKSVPRNSLESAEYLKLIT 

KIFDAFKSRLHDSNSKVNLVALETMHKMIPLLRD 

HLSPIINMLIPAIVDNNLNSKNPGIYAAATKrVVQA 

LSQHVDNYLLLQPFCTKAQFLNGKAKQDMTEKL 

ADIVTELYQRKPHATEQKVLWLWHLLGNMTN 

SGSLPGAGGNIRTATAKLSKALFAQMGQ>rLLNQ 

AASQPPHIKKSLEELLDMTILNEL 


3517 


A 


1449 


252 


QDLKPVLDREYLAIYLKMVFFTCNACGESVKKI 

QVEKHVSVCRNCECLSCIDCGKDFWGDDYKNH 

VKCISEDQKYGGKGY/EKVKTJHKGD/ASKQQAW 

IQKISELIK\RPNVSPKVRELLEQISAFDNVPQ\KK 

AKFQNWMKNSLKVHNESILDQVWNIFSEASNSE 

PVNKEQDQRPLHPVANPHAEISTKVPASKVKDA 

NHQENSRNQKPKKRKKGQEADLEAGGEEVPEA 

NGSAGKRSKKKKQRKDSASEEEARVGAGKRKR 

RHSKVETDSKKKKMiaPEHPEGGEPEDDEAPAK 

GKFTWKGTIKAILKQAPDNEITIKKLRKKVLAQY 

YTVTDEHHRSEEELLVIFNKKISKKPTFKLLKDK 

VKLVK 


3518 


A 


3 


635 


APDSNARNDHFDACSLRVOAGLSSAGPALGMSn 

LAALMASPSKAVIVPGNGGGDVTTHGWYGWVK 

KELEKIPGFQCLAKNMPDPITARESIWLPFMETEL 

HCDEKTIIIGHSSGAIAAMRYAETHRVYAIVLVSA 

YTSDLGDENERASGYFTRPWQWEKIKANCPYIV 

QFGSTDDPFLPWKEQQEVAD\SWKPNCTNSLTV 

ATFRTQSFMN 


3519 


A 


81 


2277 


VRETRREMAMAMSDSGASRLRROLESGGFEARL 

YVKQLSQQSDGDRDLQEHRQRIQALAEETAQNL 

KRNVYQNYRQFIETAREISYLESEMYQLSHLLTE 

QKSSLESIPLTLLPAAAAAGAAAASGGEEGVGGA 

GGRDHLRGQAGFFSTPGGASRDGSGPGEEGKQR 

TLTTLLEKVEGCRHLLETPGQYLVYNGDLVEYD 

ADHMAQLQRVHGFLMNDCLLVATWLPQRRGM 
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SEQ ID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=^Alantne C'=Cysteine, D=Aspartic Acid, 
E>«Glutamic Acid, F=Phenylalanine, G^Glycine, H-Histidine» 
I»Isoleucine, K-Lysine, LpLeucine, M»Methionine, 
N-Asparagine, P^Proline, Q^lutamine, R-Arginine, S=Serine, 
T^^reonine, V«VaIine, W^ryptophan, Y«Tyrosine, 
X»Un](nown, *»Stop codon, /spossible nucleotide deletion, 
^possible nucleotide insertion 










YRY1^^ALYSLDG1AVV^^VKDOTPMKDMFKLLMF 

PENRIFQAENAKIKREWLEVLEDTKRALSEKRRR 

EQEEAAAPRGPPQ VTSKATNPFBDDEEEEPA VPE 

VEEEKVDLSMEWIQELPEDLDVCIAQRDFEGAV 

DLLDKLNHYLEDKPSPPPVKELRAKVEERVRQL 

TEVLVFELSPDRSLRGGPKATRRAVSQLIRLGQC 

TKACELFLRNRAAAVHTAIRQLRIEGATLLYIHK 

LCHVFFTSLLETAREFEIDFAGTDSGCYSAFWW 

ARSAMGMFVDAFSKOVFDSKFSI STAAFCVKVA 

KEHCQQLGDIGLDLTFnHALLVKDIQGALHSYK 

EnffiATKHRNSEEMWRRMNLMTPEALGKLKEE 

MKSCGVSNFEQYTGDDCWVNLSYTVVAFTKQT 

MGFLEEALKLYFPELHMVLLESLVEIILVAVQHV 

DYSLRCEQDPEKKAFIRQNASFLYETVL\PVVEK 

RFEEGVGKPAKQLQDLRNASRLIRVNPESTTSVV 


3520 


A 


1706 


540 


FVAHLAWPWRADGDMEDGVLNEGFLVKRGHIV 

HNWKARWFILRQNILVYYKLEGGRRVTPPKGRI 

LLDGCTITCPCLEYENRPLLIKLKTQTSTEYFLEA 

CSREE/RRDAWAFEMTGAIHAGQARGKVQQLHS 

LRNSFKLPPfflSLHRIVDKMHDSNTGIRSSPNMEQ 

MLMEENFLRPVGVRSMGAIRSGDLAEQFLDDST 

ALYIPAESYKKKISPKEEISLSTVELSGTWKQGY 

LAKQGHKRKNWKVRRFVLRKDPAFLHYYDPSK 

EENRPVGGFSLRGSLVSALEDNGVPTGVKGNVQ 

GNLFKVrrK\DDTHYYIQA\SSKAE\RAE\WIGSLS 

BCSLNMNKDPEGTPDSLPSLPR 


3521 


A 


3 


3063 


HASVSLSLGCPRPCADTPGPQPQPMDLRVGQRPP 

VEPPPEPTLLALQRPQRLHHHLFLAGLQQQRSVE 

PMRVKMELPACGATLSLVPSLPAFSIPRHQSQSST 

PCPFLGCRPCPQLSMDTPMPELQEAPQEQELRQL 

LHKDKSKRSAVASSVVKQKLAEVILKKQQAALE 

RTVHPNSPGIPYRTLEPLETEGATRSMLSSFLPPV 

PSLPSDPPEHFPLRKTVSEPNLKLRYKPKKSLERR 

KNPLLRKESAPPSLRRRPAETLGDSSPSSSSTPAS 

GCSSPNDSEHGPNPILGSEALLGQRLRLQETSVAP 

FALPTVSLLPAITLGLPAPARADSDRRTHPTLGPR 

GPILGSPHTPLFLPHGLEPEAGGTLPSRLQPILLLD 

PSGSHAPLLTVPGLGPLPFHFAQSLMTTERLSGSG 

LHWPLSRTRSEPLPPSATAPPPPGPMQPRLEQLKT 

HVQVIKRSAKPSEKPRLRQIPSAEDLETDGGGPG 

QWDDGLEHRELGHGQPEARGPAPLQQHPQVLL 

WEQQRLAGRLPRGSTGDTVLLPLAQGGHRPLSR 

AQSSPAAPASLSAPEPASQARVLSSSETPARTLPF 

TTGLIYDSVMLKHQCSCGDNSRHPEHAGRIQSIW 

SRLQERGLRSQCECLRGRKASLEELQSVHSEEIHV 

LLYGTNTPLSRLKLDNGKLAGLLAQRMFVMLPCG 

GVGVDTDTIWNELHSSNAARWAAGSVTDLAFK 

VASRELKNGFAWRPPGHHADHSTAMGFCFFNS 

VAIACRQLQQQSKASKILIVDWDVHHGNGTQQT 

FYQDPSVLYISLHRHDDGNFFPGSGAVDEVGAGS 

GEGFNVNVAWAGGLDPPMGDPEYLAAFRIWM 

PIAREFSPDLVLVSAGFDAAEGHPAPLGGYHVSA 

KCFGYMTQQLMNLAGGAVVLALEGGHDLTAIC 

DASEACVAALLGNRVDPLSEEGWKQKPNLNAIR 
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SEQ ID 
NO; 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
oucleotide [ 
lociitiDn 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine OCysteine, D=Aspartic Add, 
EKSlutamic Add, F«Pbenylalanine, G=Glycine, H=Histidine, 
I»Isoleucine, K^Lysine, IHLeudne, M^Metlilonine, 
N=Asparagine, P»Proline, Q»Glutamiiie, R»Arginine, S^Serine, 
T=ThreoBine, V=VaIine, W«Tryptoplian, Y=Tyrosine, 
X=Unknown,*^top codon, /-possible nucleotide deletion^ 
\spossible nudeotide insertion 










SLEAWIRVHSKYWGCMQRLASCPDSWVPRVPG 
ADKEEVEAVTALASLSVGILAEDRPSEQLVEEEE 
PMNL 


3522 


A 


9 


602 


KMAALGEPVRLERDICRAIELLEKLQRSGEVPPQ 

KLQALQRVLQSEFCNAVREVYEHVYETVDISSSP 

EVRANATAKATVAAFAASEGHSHPRVVELPKTE 

EGLGFNIMGGKEQNSPIYISRIIP/GGIADRHGGLK 

RGDQLLSVNGVSVEGEHHEKAVELLKAAQGKV 

KLVVRYTPKVLEEMESRFEKMRSAKRRQQT 


3523 


A 


645 


1465 


IMAETSLLEAGASAASTAAALENLQVEASCSVCL 

EYLKEPVIffiCGHNFCKACITRWWEDLERDFPCP 

VCRKTSRYRSLRPNROLGSMVEIAKOLVRPSSGRS 

GMRASAPQHHEALSLFCYEDQEAVCLICAISHTH 

RAHTVVPLDDATQEYKEKLQKCLEA\LNQKLQEI 

TRCKSSEEKKPGELBCRLVESRRQQILREFEELHRR 

LDEEQQVLLSRLEEEEQDELQRLRENAAHLGDKR 

RDLAHLAAEVEGKCLQSGFEMLKVRPLPLHSPS 

G 


3524 


A 


3 


698 


PMVRHEAGEALGAIGDPEVLEILKOYSSDPVIEV 

AETCQLAVRRLEWLQQHGGEPAAGPYLSVDPAP 

PAEERNDVGRLREALLDESRPLFERYRAMFALRN 

AGGEEAALALAEGLHCGSALFRHEVGYVLGQLQ 

HEAAVPQLAAALARCTENPMVRHECAEALGAIA 

RPACLAALQAHADDPERVVRE\SCKVALDMYEH 

ETGRAFQYADGLEQLRGAPSLGPNPHPELPEDS 


3525 


A 


1452 


694 


EGLQRPEYLVASAAGFQGLAWGGEGRGRAGCS 
SSGFRDAEPLLLSCPGRNEPLKICERLK WK '^D YP 

MTDGQLRSKRDEFWDTAPAFEGRKEIWDALKA 

AAYAAEANDHELAQAILDGASITLPHGTLCECY 

DELGNRYQLPIYCLSPPVNLLLEHTEEESLEPPEP 

PPSVRREFPLKVRLSTGKDVRLSASLPDTVGQLK 

RQLHAQE/GTPKPSWQRWFFSGKLLTDRTRLQET 

KIQKDFVIQVnNQPPPPQD 


3526 


A 


123 


3441 


PGNEGLGLAADHNEDLGHLSADAPWPAVTMAP 

RKRSHHGLGFLCCFGGSDIPEINLRDNHPLQFME 

FSSPIPNAEELNIRFAELVDELDLTDICNREAMFAL 

PPEKKWQIYCSKKKEQEDPNKLATSWPDYYIDRI 

NSMAAMQSLYAFDEEETEMRNQWEDLKTALR 

TQPMEO^VTRFIELEGLTCLLNFLRSMDHATCESRI 

HTSLIGCIIALMNNSQGRAHVLAQPEAISTIAQSL 

RTENSKTKVAVLEILGAVCLVPGGHKKVLQAML 

HYQVYAAERTRFQTLLNELDRSLGRYRDEVNLK 

TAIMSFINAVLNAGAGEDNLEFRLHLRYEFLMLG 

IQPVIDKLRQHENAILDKHLDFFEMVRNEDDLEL 

ARRFDMVHIDTKSASQMFELIHKKLKYTEAYPC 

LLSVLHHCLQMPYKRNGGYFQQWQLLDRILQQI 

VLQDERGVDPDLAPLENFNVKNIVNMLINENEV 

KQWRDQAEKFRKEHMELVSRLERKERECETKTL 

EKEEMMRTVLNKMKDKLARESQELRQARGQVA 

ELVAQLSELSTGPVSSPPPPGGPLTLSSSMTTNDL 

PPPPPPLPFACCPPPPPPPLPPGGPPTPPGAPPCLG 

MGLPLPQDPYPSSDVPLRKKRVPQPSHPLKSFNW 

VKLNEERVPGTVWNEIDDMQVFRILDLEDFEKM 

FSAYQRHQELITNPSQQKELGSTEDIYLASRKVK 

ELSVDDGRRAQNCnLLSKLKLSNEEIRQAILKMD 
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SEQlD 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
'sequence 


Amino acid sequence (A^Alanine C=Cysteine, D='Aspartic Acid, 
E«Glutanilc Acid, F^Phenyl alanine, G«Glycine, H==Histidine, 
I»Isoleucine, K'=>Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=*Proline, Q=Glutamine, R=Arginine, SsSerine, 
T-Tbreoninc, V=Valinc, W«Tryptophan, Y»Tyrosioe, 
X'^Unknoivn, *«Stop codon, /^possible nucleotide deletion, 
\Fpossible nucleotide insertioD 










EQEDLAKDMLEQLLKFIPEKSDIDLLEEHKHEIER 

MARADRFLYEMSRIDHYQQRLQALFFKKKEQER 

LAEAKPKVEAILLASRELVRSKRLRQMLEVILAI 

GNFMNKGQRGGAYGFRVASLNKIADTKSSIDRN 

ISLLHYLIMILEKHFPDD^NMPSELOHLPEAAKVN 

LAELEKEVGNLRRGLRAVEVELEYQRRQVREPS 

DKFVPVMSDFirVSSFSFSELEDQLNEARDKFAK 

ALMHFGEHDSKMQPDEFFGIFDTFLQAFSEARQD 

LEAMRRRKEEEERRARMEAMLKEQRERERWQR 

QRKVLAAGSSLEEGGEFDDLVSALRSGEVFDKD 

LCKLKRSRKRSGSQALEVTRERAINRLNY 


3527 


A 


1445 


714 


LLGTRMLAGQLEARDPKEGTHPEDPCPGAGAV 

LGRQKSLWADVHGKLRSHIDALREQNMELREKL 

RALQLQRWKARKKSAASPHAGQESHTLALEPAF 

GKISPLSADEETEPKYAGHKIAQSGHSSWGQRSSS 

NNSAPPKPMSLKIERISSWKTPPQENRDKNLSRR 

RQDRRATPTGRPTPCAERRGWSEDGKVASDTCV 

TLHWPLGKFRFR 


3528 


A 


484 


1777 


RISKIQVYYSTGYSSRKMNPTLGLAIFLAVLLTVK 

GLLKPSFSPRNYKALSEVQGWKQRMAAKELAR 

QNMDLGFKLLKKLAFYNPGRNIFLSPLSISTAFS 

MLCLGAQDSTLDEIKQGFNFRKMPEKDLHEGFH 

YIIHELTQKTQDLKLSIGNltFIDQRLQPQRKFLE 

DAKI^SAETILTNFQNLEMAQKQINDFI/ESKTH 

fllfTMMT TPTsITnPfrTVMT T AMVT17KR APWVtTPTrriP 

NVTKEEDFFLEKNSSVKVPMMFRSGIYQVGYDD 

KLSCTILEIPYQKNITAIFILPDEGKLKHLEKGLQV 

DTFSRWKTLLSRRVVDVSVPRLHMTGTFDLKKT 

LSYIGVSKIFEEHGDLTKIAPHRSLKVGEAVNKA 

ELKMDERGTEGAAGTGAQTLPMETPLVVKIDKP 

YLLLIYSEKIPSVLFLGKIVNPIGK 


3529 


A 


1 


5684 


VSSVSHENPTEVFEDGENPPSSRSSESGFTEFIQY 

QADRTDDIDRELSEGQGAAAIPIGSTSSETETAST 

VGSEETIIQTPSVVTQGTATRSRKTAQKTAMQCC 

LEYVQQFLTRLINLYnQNNSFSQSLATEHQGDLG 

REQGETSKWDRNSQGDVKEKNISKQKTSKEYLS 

AFLAACQLFLECSSFPVYIAEGNHTSELRSEKLET 

DCEHVQPPQWLQTLMNACSQASDFSVQSVAISL 

VMDLVGLTQSVAMVTGENINSVEPAQPLSPNQG 

RVAWIRPPLTQGNLRYIAEKTEFFKHVALTLWD 

QLGDGTPQHHQKSVELFYQLHNLVPSSSICEDVI 

SQQLTHKDKKIRMEAHAKFAVLWHLTRDLHINK 

SSSFVRSFDRSLFIMLDSLNSLDGSTSSVGQAWL 

NQVLQRHDIARVLEPLLLLLLHPKTQRVSVQRV 

QAERYWNKSPCYPGEESDKHFMQNFACSNVSQ 

VQLITSKGNGEKPLTMDEDSNFSLTVNPLSDRLSL 

LSTSSEnPMVVSDFDLPDQQBEILQSSDSGCSQSS 

AGDNLSYEVDPETVNAOEDSOMPKESSPDDDVO 

QWFDLICKWSGLEVESASVTSQLEIEAMPPKC 

SDIDPDEETIKIEDDSIQQSQNALLSNESSQFLSVS 

AEGGHECVANGISRNSSSPCISGTTHTLHDSSVAS 

lETKSRQRSHSSIQFSFKEKLSEKVSEKETIVKESG 

KQPGAKPKVKLARKKDDDKKKSSNEKLKQTSV 

FFSDGLDLENWYSCGEGDISEIESDMGSPGSRKSP 
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S£Q0> 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

correspondiBg 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanioe C'^Cysteine, D=Aspartic Add, 
E'Glntamic Add, F"=Phenylalanine, G^^Glycine, H^Histidine, 
I°Isoleucine, K-Lyslne, L^Leucine, M'^Methionlne, 
N»Asparagine, P>»Proline, Q^GIutamine, R»Arginine, S^^erinCt 
T°Tlireonine, V=Vallne, W==Tryptopiian, Y«Tyrosinc, 
X^Unknown, *=5top codon, /^possible nucleotide deletion, 
Vppossible nucleotide insertion 










NFNIHPLYQHVLLYLQLYDSSRTLYAFSAIKAILK 

TNPIAFVNAISTTSVNNAYTPQLSLLQNLLARHRJ 

SVMGKDFYSHIPVDSNHNFRSSMYIEILISLCLYY 

MRSHYPTOVKVTAQDLIGNRNMQMMSIEILTLL 

FTELAKVTESSAKGFPSFISDMLSKCKVQKVILHC 

LLSSIFSAQKWHSEKMAGKNLVAVEEGFSEDSLI 

NFSEDEFDNGSTLQSQLLKVLQRLIV\LEHRVM\T 

IPEE\NETGFDFWS\DLEHISPHQPMTSLQYLHAQ 

SITCQGMFLCAVIRAXLHQHCACKMHPQWIGLIT 

STLPYMGKVLQRWVSVTLQLCRNLDMLIQQTK 

YETGLSDSRPLWMASIIPPDMILTLLEGITAIIHTC 

LLDPTTQYHQLLVSVDQKHLFEARSGILSILHMI 

MSSVTLLWSILHQADSSEKMTIAASASLTTINLG 

ATKM.RQQILELLGPISMNHGVHFMAAIAFVWN 

ERRQNKTTTRTKVIPAASEEQLLLVELVRSISVM 

RAETVIQTVKEVLKQPPAIAKDKKHLSLEVCML 

QFFYAYIQRIPVPNLVDSWASLLILLKDSIQLSLP 

APGQFLILGVLNEFIMKl^SLENKKDQRDLQDVT 

HKTVnDAIGAIAGSSLEQTTWLRRNLEX^SPKIM 

VDGTNLESDVEDMLSPAMETANITPSVYSVHAL 

TLLSEVLAHLLDMVFYSDEKERVEPLLVNIMHYV 

VPYLRNHSAHNAPSYRACVQLLSSLSGYQYTRR 

AWKKEAFDLFMDPSFFQMDASCVNHWRAIMDN 

LMTHDKTTFRDLMTRVAVAQSSSLNLFANRDVE 

LEQRAMLLKRLAFAIFSSEIDQYQKYLPDIQERLV 

ESLRLPQVPTLHSQVFLFFRVLLLRMSPQHLTSL 

WPTMITELVOVFLLMEOELTADEnTSRTSriPSVA 

GLETTYTGGNGFSTSYNSQRWLNLYLSACKFLD 

LALALPSENLPQFQMYRWAFIPEASDDSGLEVRR 

QGIHQREFKPYVVRLAKLLRKRAKKNPEEDNSG 

RTLGWEPGHLLLTICTVRSMEQLLPFFNVLSQVF 

NSKVTSRCGGHSGSPILYSNAFPNKDMKLENHKP 

CSSKARQKIEEMVEKDFLEGMIKT 


3530 


A 


1 


5684 


VSSVSHENPTEVFEDGENPPSSRSSESGFTEFIQY 

QADRTDDIDRELSEGQGAAAIPIGSTSSETETASr 

VGSEETnQTPSVVTQGTATRSRKTAQKTAMQCC 

LEYVQQFLTRLINLYIIQNNSFSQSLATEHQGDLG 

REQGETSKWDRNSQGDVKEKNISKQKTSICEYLS 

AFLAACQLFLECSSFPVYIAEGNHTSELRSEKLET 

DCEHVQPPQWLQTLMNACSQASDFSVQSVAISL 

VMDLVGLTQSVAMVTGENINSVEPAQPLSPNQG 

RVAWIRPPLTQGNLRYIAEKTEFFBaiVALrLWD 

QLGDGTPQHHQKSVELFYQLHNLVPSSSICEDVI 

SQQLTHKDKKIRMEAHAKJAVLWHLTRDLHINK 

SSSFVRSFDRSLFIMLDSLNSLDGSTSSVGQAWL 

NQVLQRHDIARVLEPLLLLLLHPKTQRVSVQRV 

QAERYWNKSPCYPGEESDKHFMQNFACSNVSQ 

VQLITSKGNGEKPLTMDEIENFSLTVNPLSDRLSL 

LSTSSETIPMWSDFDLPDQQIEILQSSDSGCSQSS 

AGDNLSYEVDPETVNAQEDSQMPBCESSPDDDVQ 

QVVFDLICKVVSGLEVESASVTSQLEffiAMPPKC 

SDIDPDEETIKIEDDSIQQSQNALLSNESSQFLSVS 

AEGGHECVANGISRNSSSPCISGTTHTLHDSSVAS 

lETKSRQRSHSSIQFSFKEKLSEKVSEKETIVKESG 

KQPGAKPKVKLARKKDDDKKKSSNEKLKQTSV 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

locatioii 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
•location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A"=Alanine C=Cysteine, D^Aspartic Acid, 
EXSlutamic Acid, F==Pbenylalanine, G^Glycine, H^Histidine» 
Msoleucine, K'^Lysine, L=Leucioe, M»Metiiionine, 
N=^Asparagine, PNProlioCy Q^lutamioe, RsArginine, S=^erioe, 
T^Threonlne, V«VaIine, W^Tryptophan, y=Tyrosine, 
X«=UnkD0wn, '^'s^Stop codon, /-possible nucleotide delctioa, 
^possible nucleotide Insertion 










FFSDGLDLENWYSCGEGDISEIESDMGSPGSRKSP 

NFNIHPLYQHVLLYLQLYDSSRTLYAFSAIKAILK 

TNPIAFVNAISTTSVNNAYTPQLSLLQNLLARHRI 

SVMGKDFYSHIPVDSNHNFRSSMYIEILISLCL YY 

MRSHYPTHVKVTAQDLIGNRNMQMMSIEILTLL 

FTELAKVIESSAKGFPSFISDMLSKCKVQKVILHC 

LLSSIFSAQKWHSBKMAGKI^VAVEEGFSEDSLI 

NFSEDEFDNGSTLQSQLLKVLQRLIV\LEHRVM\T 

IPEE\NETGFDFWS\DLEHISPHQPMTSLQYLHAQ 

SITCQGMFLCAVIRAVLHQHCACKMHPQWIGLIT 

STLPYMGKVLQRVVVSVTLQLCRNLDNLIQQYK 

YETGLSDSRPLWMASIIPPDMILTLLEGITAIIHYC 

LLDPTTQYHQLLVSVDQKHLFEARSGILSILHMI 

MSSVTLLWSILHQADSSEKMTIAASASLTTINLG 

ATKNLRQQILELLGPISMNHGVHFMAAIAFVWN 

ERRQl^TTTRTKVIPAASEEQLLLVELVRSISVM 

RAETVIQTVnCEVLKQPPAIAKDKKHLSLEVCML 

QFFYAYIQRIPVPNLVDSWASLLILLKDSIQLSLP 

APGQFLLLGVLNEFIMKNPSLENKKDQRDLQDVT 

HKIVDAIGAIAGSSLEQTTWLRRNLEVKPSPKIM 

VDGTNLESDVEDMLSPAMETANITPSVYSVHAL 

TLLSEVLAHLLDMVFYSDEKERVIPLLVNIMHYV 

VPYLRNHSAHNAPSYRACVQLLSSLSGYQYTRR 

AWKKEAFDLFMDPSFFQMDASCVNHWRAIMDN 

LMTHDKTTFRDLMTRVAVAQSSSLNLFANRDVE 

LEQRAMLLKRLAFAIFSSEIDQYQKYLPDIQERLV 

ESLRLPQVPTLHSQVFLFFRVLLLRMSPQHLTSL 

GLETTYTGGNGFSTSYNSQRWLNLYLSACKFLD 

LALALPSENLPQFQMYRWAFIPEASDDSGLEVRR 

QGfflQREFKPYVVRLAKLLRKRAKKNPEEDNSG 

RTLGWEPGHLLLTICTVRSMEQLLPFFNVLSQVF 

NSKVTSRCGGHSGSPILYSNAFPNKDMKLENHKP 

CSSKARQKIEEMVEKDFLEGMIKT 


3531 


A 


553 


2470 


LISPSPALSSQDPALSLKENLEDISGWGLPEARSK 

ESVSFKDVAVDFTQEEWGQLDSPQRALYRDVM 

LENYQNLLALGPPLHKPDVISHLERGEEPWSMQ 

REVPRGPCPEWELKAVPSQQQGICKEEPAQEPIM 

ERPLGGAQAWGRQAGALQRSQAAP\GR\RTCHG 

LGRP\VEEFPLRCPLFAQQRVPEGGPLLDTRK>rV 

QATEGRTKAPARLCAGENASTPSEPEKFPQVRRQ 

RGAGAGEGEFVCGECGKAFRQSSSLTLHRRWHS 

REKAYKCDECGKAFTWSTNLLEHRRJHTGEKPFF 

CGECGKAFSCHSSLNVHQRIHTGERPYKCSACEK 

AFSCSSLLSMHLRVHTGEKPYRCGECGKAFNQR 

THLTRHHRIHTGEKPYQCGSCGBCAFTCHSSLTVH 

EKIHSGDKPFKCSDCEKAFNSRSRLTLHORTHTG 

EKPFKCADCGKGFSCHAYLLVHRRIHSGEKPFKC 
NECGKAFSSHAYLIVHRRIHTGEKPFDCSQCWKA 
FSCHSSLIVHQRIHTGEKPYKCSECGRAFSQNHCL 
EKHQKfflSGEKSFKCEKCGEMFNWSSHLTEHQRL 
HSEGKPLAIQFNKHLLSTYYVPGSLLGAGDAGLR 
DVDPIDALDVAKLLCVVPPRAGRNFSLGSKPRN 


3532 


A 


3931 


317 


HRELQDSPSAEPPAGSMPLRHWGMARGSKPVGD 
GAQPMAAMGGLKVLLHWAGPGGGEPWVTFSES 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A»Alanine C=Cysteine, D^Aspartic Acid* 
E»Glutamic Acid, F^Phenylalanine, G^lycine, H-Histidine, 
I-Isoleucine, K»Lysine, Lr=Leucine, M=Metliionine, 
N=Asparagine, P^Proline, Q^Glutamioe, R-Aiigininc, S=^erine, 
T»Threonine, V=Valine, W<»Tryptopliaii, Y^Tyrosine, 
X*"Unlcnown, ^'^Stop codon, ^possible nucleotide deletion, 
>Fpossibte nucleotide insertion 






■ 




SLTAEEVCIHIAHKVGITPPCFNLFALFDAQAQV 

WLPPNHILEIPRDASLMLYF\RHRFYSR\NWHGM 

NPREPAVYRCGPPGTEASSDQTAQGMQLLDPAS 

FEYLFEQGKHEFVNDVASLWELSTEEEIHHFKNE 

SLGMAFLHLCHLALRHGIPLEEVAKKTSFKDCIP 

RSFRRHIRQHSALTRLRLRNVFRRPLRDFQPGRLS 

QQMVMVKYLATLERLAPRFGTBRVPVCHLRLLA 

QAEGEPCYIRDSGVAPTDPGPESAAGPPTHEVLV 

TGTGOIQWWPVEEEVNKEEGSSGSSGRNPQASL 

FGKKAKAHKAFGQPADRPREPLGAYFCDFRDIT 

HVGLKEHCVSIHRQDNKCLELSLPSRAAALSFVS 

LVDGYFRLTADSSHYLCHEVAPPRLVMSIRDGIH 

GPLLEPFVQAKLRPEDGLYLIHWSTSHPYRLILTV 

AQRSQAPDGMQSLRLRKFPIEQQDGAFVLEGWG 

RSFPSVRELGAALQGCLLRAGDDCFSLRRCCLPQ 

PGETSNLIIMRGARASPRTLNLSQLSFHRVDQKEI 

TQLSHLGQGTRTNVYEGRLRVEGSGDPEEGKMD 

DEDPLVPGRDRGQELRVVLKVLDPSHHDIALAF 

YETASLMSQVSHTHLAFVHGVCVRGPENIMVTE 

YVEHGPLDVWLRRERGHVPMAWKMWAQQLA 

SALSYLENKNLVHGNVCGRNILLARLGLAEGTSP 

FIKLSDPGVGLGALSREERVERIPWLAPECLPGG 

ANSLSTAMDKWGFGATLLEICFDGEAPLQSRSPS 

EKEHFYQRQHRLPEPSCPQLATLTSQCLTYEPTQ 

RPSFRTILRDLTRLQPHNLADVLTVNPDSPASDPT 

VFHKRYLKKIRDLGEGHFGKVSLYCYDPTNDGT 

GEMVAVKALKADCGPQHRSGWKQEIDILRTLYH 

PRHSIGLAQLLLFAQQICEGMAYLHAQHYIHRDL 

AARNVLLDNDRLVKIGDFGLAKAVPEGHEYYRV 

REDGDSPVFWYAPECLKEYKFYYASDVWSFGVT 

LYELLTHCDSSQSPPTKFLELIGIAQGQMTVLRLT 

ELLERGERLPRPDKCPCEVYHLMKNCWETEASF 

RPTFENLIPILKTVHEKYQGQAPSVFSVC 


3533 


A 


182 


3465 


FRWLDFFRGSINSQFEFGRKKENMTSPAKFKKDK 

EIIAEYDTQVKEIRAQLTEQMKCLDQQCELRVQL 

LQDLQDFFRKKAEIEMDYSRNLEKLAERFLAKT 

RSTKDQQFKKDQNVLSPVNCWNLLLNQVKRES 

RDHTTLSDIYLNNIIPRFVQVSEDSGRLFICKSKEV 

GQQLQDDLMKVLNELYSVMKTYHMYNADSISA 

QSKLKEAEKQEEKQIGKSVKQEDRQTPRSPDSTA 

NVRJEEKHVRRSSVKKIEKMKEKRQAKYTENKL 

KAIKARNEYLLALEATNASVFKYYIHDLSDLIDQ 

CCDLGYHASLNRALRTFLSAELNLEQSKHEGLD 

AIENAVBNLDATSDKQRLMEMYNNVFCPPMKFE 

FQPHMGDMASQLCAQQPVQSELLQRCLQLQSRL 

STLKIENEEVKKTMEATLQTIQDIVTVEDFDVSD 

CFQYSNSI^SVKSTVSETFMSKPSIAKRRANQQE 

TEOFYFTKMKEYLEGRNLITKLOAKHDLLOKTL 

GESQRTDCSLARRSSTVRKQDSSQAIPLWESCIR 

nSRHGLQHEGIFRVSGSQVEVNDIKNAFERGEDP 

LAGDQNDHDMDSIAGVLKLYFRGLEHPLFPKDIF 

HDLMACVTMDNLQERALHIRKVLLVLPKTTLII 

MRYLFAFLNHLSQFSEENMMDPYNLAICFGPSL 

MSVPEGHDQVSCQAHVNELIKTniQHENrFPSPRE 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A==Alanine OCysteine, D='Aspartlc Acid, 
E<=Glutamic Acid, F^Phenylalanine, G^GIycine, H^Histidine, 
I-Isoleucine, K-Lyslne, L»Leucine, M»Alethionine, 
N^'Asparagine, P^Proline, Q^Glutamine, R»Arginine, S=Serlne, 
T°Threoninc, V<=Valine, W^Tryptophan, Y«Tyrosine, 
X^Unknown, *=^top codon, /possible nodeotide deletion, 
X^possible nucleotide insertion 










LEGPVYSRGGSMEDYCDSPHGETTSVEDSTQDV 

TAEHHTSDDECEPIEAIAKFDYVGRTARELSFKK 

GASLLLYQRASDDWWEGRHNGIDGLIPHQYIVV 

QDTEDGVVERSSPKSEIEVISEPPEEKVTARAGAS 

CPSGGHVADIYLANINKQRIGlPESGSIRKrFRSDS 

PDKCSISGHGSLNSISRHSSLKNRLDSPQIRKTAT 

AGRSKSFDNHRPMDPEVIAQDffiATMNSALNELR 

ELERQSSVKHTPDWLDTLEPLKTSPVVAPTSEPS 

SPLHTQLLKDPEPAFQRSASTAGDIACAFRPVKS 

VKMAAPVKPPA'RRPKP'RVFPKTNATSPGVNSST 

SPQSTDKSCTV 


3534 


A 


1 


2640 


FRRFVCPASRRPAAGLRDAASSAPRGMASEGPRE 

PESBGIKLSADVKPFVPRFAGLNVAWLESSEACV 

FPSSAATYYPFVQEPPVTEQKIYTEDMAFGASTFF 

PQYLSSEITLHPYAYSPYTLDSTQNVYSVPGSQY 

LYNQPSCYRGFQTNOCHRNENTCPLPQEMKALFK 

KKTYDEKKTYDQQKFDSERADGTISSEIKSARGS 

HHLSIYAENSLKSIXiYHKRTDRKSRIIAKNVSTS 

KPEFEFTTLDFPELQGAENNMSEIQKQPKWGPVH 

SVSTDISLLREWKPAAVLSKGEIWKNNPNESV 

TANAATNSPSCTRELSWTPMGYWRQTLSTELS 

AAPKNVTSMINLKTIASSADPKNVSIPSSEALSSD 

PSYNKEKHUHPTQKSKASQGSDLEQNEASRKNK 

KKKEKSTSKYEVLTVQEPPRIEDAEEFPNLAVAS 

ERRDRIETPKFQSKQQPQDNFKNNVKKSQLPVQL 

DLGGMLTALEKKQHSQHAKQSSKPVVVSVGAV 

PVLSKECASGERGRRMSQMKTPHNPLDSSAPLM 

KKGKQREIPKAKKPTSLKKIILKERQERKQRLQE 

NAVSPAFTSDDTQDGESGGDDQFPEQAELSGPEG 

MDELISTPSVEDKSEEPPGTELQRDTEASHLAPN 

HTTFPKIHSRRFRDYCSQMLSKEVDACVTDLLKE 

LKLKKLKCVIISPNCEKIQSKGGLDDTLHTIIDYA 
CEQNIPFVFALNRKALGRSLNKAVPVSVVGIFSY 
DGAQDQFHKMVELTVAARQAYKTMLENVQQE 
LVGEPXSLRHLPAYPHRAPAALQKMAPQPAVKEK 
EEPHYIEIWKKHLEAYSGCTLELEESLEASTSQM 
MNLNL 


3535 


A 


1747 


983 


LFQFQVCRSVLSPRAAGCTWSLAPRSRGAAGSPR 

RYRGPOPOPAPPSAT P'N^RPSPVA^^GRPMVVT '^V 

PAEVTmLDIEGTTTPIAFVKDELFPYIEENVKEY 

LQTHWEEEECQQDVSLLRKQVXFADVVPAVRKW 

REAGMKVYIYSSGSVEAQKLLFGHSTEGDILELV 

DGHFDTKIGHKVESESYRKIADSIGCSTNNILFLT 

DVTREASAAEEADVHVAVWRPGNAGLTDDEK 

TYYSLITSFSELYLPSST 


3536. 


A 


3 


1302 


GRPPTAPHTGRPPTANRGDPRLDLKRGCARLLTS 

lESRGRPAASAGLRRDRCALRRWPLRRAPLARAT 

RRRAGSPRRCAPRPRACPQGWSRARHQPGGLCL 

LLLLLCQFMEDRSAQAGNCWLRQAKNGRCQVL 

YKTELSKEECCSTGRLSTSWTEEDVNDNTLFKW 

MIFNGGAPNCPCJ^BTCENVDCGPGKKCRMNKK 

NKPRCVCAPDCSNTTWKGPVCGLDGKTYRNECA 

LLKARCKEQPELEVQYQGRCKKTCRDVFCPGSS 
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SEQID 
NO: 


Method 

• 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C~Cysteine, D^Aspartic Acid, 
E^lutamic Acid, F^Phenylalanine, G=Glycine, H==Histidine, 
I»Isoleacine, K^Lysine, L=Leucine, M»Methionine, 
N-Asparagine, P^ProIine, Q=Glutamine, RBArginine, SoSerine, 
T^Threonine, V»Valine, W==Tryptophan, Y=Tyrosine, 
X>»Unl(nown, *=Stop codon, ^possible nucleotide deletion* 
V=pos5ible nucleotide insertion 










TCVWDQTNNAYCVTCNRICPEPASSEQYLCGND 

GVTYS\SACHLRKATCLLGRSIGLAYEGKCIKAK 

SCEDIQCTGGKKCLWDFKVGRGRCSLCDELCPD 

SKSDEPVCASDNATYASECAMKEAACSSGVLLE 

VKHSGSCNSISEDTEEEEEDEDQDYSFPISSILEW 


3537 


A 


285 


2123 


IGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRF 

LTSIPTGIPEDATTLYLQNNQINNAGIPSDLKNLL 

KVERIYLYHNSLDEFPTNLPKYVKELHLQEN>}IR 

TITYDSLSKIPYLEELHLDDNSVSAVSIEEGAFRD 

SNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRIS 

TISSPSLQGLTSLKRLVLDGNLLNNHGLGDKVFF 

NLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQ 

DNHINRVPPNAFSYLRQLYRLDMSNNNLSNLPQ 

GIFDDLDNITQLILRNNPWYCGCKMKWVRDWL 

QSLPVKVNVRGLMCQAPEKVRGMAIKDLNAELF 

DCKDSGIYSTIQITTAIPNTVYPAQGQWPAPVIK 

V^x^JL/JKJNr AJLr 1 JvLlrll^ 1 1 vioJroJKJv mix V Jvo V 1 oU 1 1 

HISWKLALPMTALRLSWLKLGHSPAFGSITETIVT 

GERSEYLVTALEPDSPYKVCMVPMETSNLYLFD 

ETPVCffiTETAPLRMYNPTTTLNREQEKEPYKNP 

NLPLAAIIGGAVALVTIALLALVCWYVHRNGSLF 

SRNCAYSKGRRRIODDYAEAGTKKDNSBLEIRETS 

FQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNH 


3538 


A 

* 


877 


6184 


WNVKPSLLVVQLFKFSDKEEHEQNDSISGKTGET 

GVEEMIATRKVEQDSKETVKLSHEDDfflLEDAGS 

SDISSDAACTNPNKTENSLVGLPSCVDEVTECNL 

ELKDTMGIADKTENTLERNKIEPLGYCEDAESNR 

QLESTEFNKSNLEVVDTSTFGPESNILENAICDVP 

DQNSKQLNAIESTKIESHETANLQDDRNSQSSSV 

SYLESKSVKSKHTKPVIHSKQNMri'DAPKKIVAA 

KYEVIHSKTKVNVKSVKRNTDVPESQQNFHRPV 

KVRKKQIDKEPKIQSCNSGVKSVKNQAHSVLKK 

TLQDQTLVQIFKPLTHSLSDKSHAHPGCLKEPHH 

PAQTGHVSHSSQKQCHKPQQQAPAMKTMSHVK 

EELEHPGVEHFKEEDKLKLKKPEKNLQPRQRRSS 

KSFSLDEPPLFIPDNI ATIRREG SDHSSSFESKYMW 

TPSKQCGFCKKPHGNRFMVGCGRCDDWFHGDC 

VGLSLSQAQQMGEEDKEYVCVKCCAEEDKKTEI 

LDPDTLENQATVEFHSGDKTMECEKLGLSKHTT 

NDRmYIDDTVKHKVKILKRESGEGRNSSDCRD 

NEIKKWQLAPLRKMGQPVLPRRSSEEKSEKIPBCE 

STTVTCTGEKASKPGTHEKQEMKKKKVAEKGVX 

NVHPAASASECPSADQIRQSVRHSLKDILMKRLTD 

SNLKVPEEKAAKVATKIEKELFSFFRDTDAKYKN 

KYRSLMFNLKDPKNNELFKKVLKGEVTPDHLIR 

MSPEELASKELAAWRRRENRHTIEMIEKEQREVE 

RRPITXITHKGEIEEESDAPMKEQEAAMEIQEPAA 

NKSLEKPEGSEK\RKEEVDSMSKDTTSQHRQHLF 

DLNCKICIGRMAPPVDDLSPKKVKVWGVARKH 

SDNEAESIADALSSTSNILASEFFEEEKQESPKSTF 

SPAPRPEMPGTVEVESTFIjya-NFIWKGFINMPS 

VAKFVTKAYPVSGSPEYLTEDLPDSIQVGGRISPQ 

TVWDYVEKIKASGTKEICVVRFTPVTEEDQISYT 

LLFAYFSSRKRYGVAANNMKQVKDMYLIPLGAT 

DKIPHPLVPFDGPGLELHRPNLLLGLURQKLKRQ 
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1 SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C=Cysteine, D^Aspartic Add, 
E>=Glutamic Acid, F^Phenylalanine, G=Glycioe, H^Histidine, 
I'^Isoleucine, K^Lysinc, Lr=Lcudne, M=Methionine» 
N«A$paragine, P"Proline, Q=Glutamine, R=Argiiiine, S=Serine, 
T"Threonine, V-Vallne, W=Tryptophan, Y-Tyroslne, 
X-Unlcnown, *«Stop codon, /'=possible nucleotide deletion, 
\spossible nucleotide insertion 










HSACASTSHIAETPESAPPIALPPDKXSKIEVSTEE 

APEEEhnDFFNSFTTVLHKQRNKPQQNLQEDLPTA 

VEPLMEVTKQEPPKPLRFLPGVLIGWENQPTTLE 

LANKPLPVDDILQSLLGTTGQVYDQXAQSVMEQ 

NTVKEIPFLKEQTNSKIEKTDNVEVTDGENKEIK 

VKVDNISESTDKSAEIETSWGSSSISAGSLTSLSL 

RGKPPDVSTEAFLTNLSIQSKQEETVESKEKTLKR 

QLQEDQENWLQDNQTSNSSPCRSNVGKGNIDGN 

VSCSENLVANTARSPQFINLKRDPRQAAGRSQPV 

TTSESKDGDSCRNGEKHMLPGLSHNKEHLTEQIN 

VEEKLCSAEKNSCVQQSDNLKVAQNSPSVENIQT 

SQAEQAKPLQEDILMQNIETVHPFRRGSAVATSH 

FEVGNTCPSEFPSKSITFTSRSTSPRTSTNFSPMRP 

QQFNLQHLKSSPPGFPFPGPPNFPPQSMFGFPPHL 

r rr JLdUrJr JrOr U\r AM^ JN Jr M V r W rr V V \rLLr\vjv^rV^K 

MMGPLSQASRYIGPQNFYQVKDIRRPERRHSDP 

WGRQDQQQLDRPFNRGKGDRQRFYSDSHHLKR 

ERHEKEWEQESERHRRRDRSQDKDRDRKSREEG 

HKDKERARLSHGDRGTDGKASRDSRNVDKKPD 

KPKSEDYEKDKEREKSKHREGEKDRDRYHKDR 

DHTDRTKSKR 


3539 


A 


157 


1769 


GSWTVELSLKPSASPSLKWVCLPGAAAVNKHRS 

GAGGLIRSLIQCTWAPAGPARRGGRGIEDFPYLF 

FQLTHCQQRICSVTQAGVQWCDHSSLQPQTPGL 

NQSSHLSLLSSRDYRMLSSFNEWFWQDRFWLPP 

NVTWTELEDRDGRVYPHPQDLLAALPLALVLLA 

MRLAFERFIGLPLSRWLGVRDQTRRQVKPNATL 

EKHFLTEGHRPKEPQLSLLAAQCGLTLQQTQRW 

FRRRRNQDRPQLTJKKFCEAS WRFLFYLSSF VGGL 

SVLYHESWLWAPVMCWDRYPNQLTLSCPAADS 

EAVSLYWWYLLELGFYLSLLIRLPFDVKRKGGGP 

ooiJSjrsSJrri ikJrro 1 AUJrlsiil^ VlJnrlr VA VlLrMlro Y 

SANLLRIGSLVLLLHDSSDYLLEACKMVNYMQY 

QQVCDALFLIFSFVFFYTRLVLFPTQILYTTYYESI 

SNRGPFFGYYFFNGLLMLLQLLHVFWSCLILRML 

YSFMKKGQMEKDIRSDVEESDSSEEAAAAQEPL 

QLKNGTAGGPRPAPTDGPRSRVAGRLTNRHTTA 

T 


3540 


A 


267 


1397 


SPAGYCHSGLLPGCSRSA/CADLAKHQELPGKKL 
LSEKKLKRYFVDYRRVLVCGGNGGAGASCFHSE 
PRKEFGGPDGGDGGNGGHVILRVDQQVKSLSSV 
LSRYQGFSGEDGGSKNCFGRSGAVLYIRVPVGTL 
VKEGGRVVADLSCVGDEYIAALGGAGGKGNRF 

GMVGFPNAGKSSLLRAISNARPAVASYPFTTLKP 

HVGIVHYEGHLQIAVADIPGIIRGAHQNRGLGSA 

FLRMERCRFLLFWDLSQPEPWTQVDDLKYELE 

MYEKGLSARPHAIVANKIDLPEAQANLSQLRDH 

LGOEVIVLSALTGENLEOLLLHLICVLYDAYAEA 

ELGQGRQPLRW 


3541 


A 


1 


800g 


DTQVSETLKRFAGKVTTASVKERJREILSELGKCV 

AGKDLPEGAVKGLCKLFCLTLHRYRDAASRRAL 

QAAIQQLAEAQPEATAKNLLHSLQSSGIGSKAGV 

PSKSSGSAALLALTWTCLLVRIVFPSRAKRQGDI 

WNKLVEVQCLLLLEVLGGSHKilAVDGAVKKLT 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
locatioD 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino add sequence (A»Alanine C-Cysteine, D»Aspartic Acid, 
E^GIutamic Acid, F^Phenylalanine, G==G!ycine, H^Histidine, 
I=IsoIeucine, K»Lysine, Lr^Leucine, MsMethionine, 
N=Asparagine, P^'Proline, Q=Glutamine, R=ArginiDe, S=Serine, 
T=Threonine, V^Valine, W«Tryptophan, Y»Tyrosine, 
XsUnknown, *»Stop codon, ^possible nucleotide deletion, 
V=possible nucleotide insertioo 








1 


KLWKENPGLVEQYLSAILSLEPNQOTAGMLGLL 

VQFCTSHKEMDVVSQHKSALLDFYMKNILMSK 

VKPPKYLLDSCAPLLRYLSHSEFKDLILPTIQKSL 

LRSPENVIETISSLLASVTLDLSQYAMDIVKGLAG 

HLKSNSPRLMDEAVLALRl^ARQCSDSSAMESL 

TKHLFAILGGSEGKLTVVAQKMSVLSGIGSVSHH 

VVSGPSSQVLNGIVAELFIPFLQQEVHEGTLVHA 

VSVLALWCNRFTMEVPKKLTEWFKKAFSLKTST 

SAVRHAYLQCMLASYRGDTLLQALDLLPLLIQT 

VEKAASQSTQVPTITEGVAAALLLLKLSVADSQA 

EAKLSSFWQLIVDEKKQVFTSEKFLViV4ASEDAL 

CTVLH\LTERLFLDHPHRLTG>IKVQQyHRALVA 

VLLSRTWHVRRQAQQTVRKLLSSLGGFKLAHGL 

LEELKTVLSSHKVLPLEALVTDAGEVTEAGKAY 

VPPRVLQEALCVISGVPGLKGDVTDTEQLAQEM 

LnSHHPSLVAVQSGLWPALLARMKTOPEAFITRH 

LDQEPRMTTQSPLNQSSMNAMGSLSVLSPDRVL 

PQLISHTASVQNPALRLVTREEFAIMQTPAGELY 

DKSnQSAQQDSDCKANMKRENKAYSFKEQIIELE 

LKEEEKKKKGIKEEVQLTSKQKEMLQAQLDREA 

QVRRRLQELDGELEAALGLLDIILAKNPSGLTQYI 

PVLVDSFLPLLKSPLAAPRIKNPFLSLAACVMPSR 

LKALGTLVSHVTLRLLKPECVLDKSWCQEELSV 

AVKRAVMLLHTHTITSRVGKGEPGAAPLSAPAFS 

LVFPFLKMVLTEMPHHSEEEEEWMAQILQILTVQ 

AQLRASPNTPPGRVDENGPELLPRVAMLRLLTW 

VIGTGSPRLQVLASDTLTTLCASSSGDDGCAFAE 

QEEVDVLLCALQSPCASVRETVLRGLMELHMVL 

PAPDTDEKNGLNLLRRLWWKFDKEEEIRKLAE 

RLWSMMGLDLQPDLCSLLIDDVIYHEAAVRQAG 

AEALSQAVARYQRQAAEVMGRLMEIYQEKLYR 

PPPVLDALGRVISESPPDQWEARCGLAJLALNKLS 

QYLDSSQVKPLFQFFVPDALNDRHPDVRKCMLD 

AALATLNTHGKENVNSLLPVFEEFLKNAPNDAS 

YDAVRQSVVVLMGSLAKHLDKLSDPKVKPIVAKL 

lAALSTPSQQVQESVASCLPPLVPAIKEDAGGMIQ 

RLMQQLLESDKYAERKGAAYGLAGLVKGLGILS 

LKQQEMMAALTDAJQDKKNFRJRREGALFAFEM 

LCTNdLGKLFEPYVVHVLPHLLLCFGDGNQYVRE 

AADDCAKAVMSNLSAHGVKLVLPSLLAALEEES 

WRTKAGSVELLGAMAYCAPKQLSSCLPNIVPKL 

TEVLTDSHVKVQKAGQQALRQIGSVIRNPEILAI 

APVLLDALTDPSRKTQKCLQTLLDTKFVHFIDAP 

SLALIMPIVQRAFQDRSTDTRKNdAAQlIGNMYSL 

TDQKDLAPYLPSVTPGLKASLLDPVPEVRTVSAK 

ALGAMVKGMGESCFEDLLPWLMETLTYEQSSV 

DRSGAAQGLAEVMAGLGVEKLEKLMPEIVATAS 

KVDIAPHVRDGYIMMFNYLPITFGDKFTPYVGPn 

PCILKALADENEFVRDTALRAGORVISMYAETAl 

ALLLPQLEQGLFDDLWRIRFSSVQLLGDLLFHISG 

VTGKMTTETASEDDNFGTAQSNKAIITALGVERR 

NRVLAGLYMGRSDTQLWRQASLHVWKIVVSN 

TPRTLREELPTLFGLLLGFLASTCADKRTIAARTL 

GDLVRKLGEKILPEIIPILEEGLRSQKSDERQGVCI 

GLSEIMKSTSRDAVLYFSESLVPTARKALCDPLE 
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SEQID 
NO: 


Method 

( 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C^Cystelne, D^-Aspartic Acid, 
£==Glutumic Acid, Fepbenylalanine, G^iycine, H-Hlstldine, 
I=Isoleudne, K=Lysine, L^Leudne, M=Metbionine, 
N»A8paragine» P^Proline» Q=Glotamlne, R=AisiDine, S^Serine, 
T"Threonine» V«Vallne, W^HTryptophan, Y=Tyrosine, 
X=Unl<nown, *=^top codon, /"^possible nucleotide deletion, 
V=possible nucleotide insertion 










EVREAAAKTFEQLHSTIGHQALEDILPFLLKQLD 

DEEVSEFALDGLKQVMAIKSRWLPYLVPiaTTP 

PVNTRVLAFLSSVAGDALTRHLGVILPAVMLAL 

KEKLGTPDEQLEMANCQAVILSVEDDTGHRIIIE 

DLLEATRSPEVGMRQAAAIILNIYCSRSKADYTS 

HLRSLVSGLIRLFNDSSPVVLEESWDALNAITKK 

LDAGNQLALIEELHKEIRLIGNESKGEHVPGFCLP 

KKGVTSILPVLREGVLTGSPEQKEEAAKALGLVI 

RLTSADALRPSVVSITGPLIRILGDRFSWNVKAAL 

LETLSLLLAKVGIALKPFLPQLQTTFTKALQDSNR 

G VRLKAAD ALGKLISIHIKVDPLFl ELLNGIRAME 

DPGVRDTMLQALRFVIQGAGAKVDAVIRKNIVS 

LLLSMLGHDEDNTRISSAGCLGELCAFLTEEELS 
AVT nnpT 1 Anvcomwvn/Dtr/^PGT ai cvai/mv 

/\ V LA^\^\*Liljr\lJ V OvTULi W iVl V JxriOl\.oJL«AljO V A V IN V 

APGRLCAGRYSSDVQEMILSSATADRIPIAVSGV 

RGMGFLMRHHIETGGGQLPAKLSSLFVKCLQNP 

SSDIRLVAEKMIWWANKDPLPPLDPQAIKPILKA 

LLDNTBO^KNTVVRAYSDQAIVNLLKMRQGEEVF 

QSLSKE-DVASLEVLNEVNRRSLKKLASQADSTE 

QVDDTILT 


3542 


A 

■ 


62 

* 


1130 


PWNPQDFPGNRGLMG\QKGEIGPP\GQQGKKGAP 
GMP\GLMGSNGSPGQPGTPGSKGSKGEPGIQGMP 
GASGLKGEPGATGSPGEPGYMGLPGIQGKKGDK 
GNQGEKGIQGQKGENGRQGIPGQQGIQGHHGAK 

KJaVSjr/JSXjilrKJ V KOAJlOoJvLi coO V L/Lf JLMvjr AOIrK. 

GQPGDPGPQGPPGLDGKPGREFSEQFIRQVCTDV 
IRAQLPVLLQSGRIRNCDHCLSQHGSPGIPGPPGPI 
GPEGPRGLPGLPGRDGVPGLVGVPGRPGVRGLK 
GLPGRNGEKGSQGFGYPGEQGPPGPPGPEGPPGI 

PSLCFSVIARRDPFRKGPNY 


3543 


A 


654 


194 


PARSLEKMKLASVVLSLLGYLVVPSGAYILGRCTV 

AKKLHDGGLDYFERYSLENWVCLAYFESKFNPS\ 

AIYENTREGYTGFGLFQMRGSDWCGDHGRNRC 

HMSCSALLNPNLEKTIKCAKTIVKGKEGMGAWP 

TWSRYCQYSDTLARWLDGCKL 


3544 


A 


2 


1074 


SCRLAAGRLAQWLLRASRSGMLRAGWLRGAAA 
LALLLAARVVAAFEPITVGLAIGAASAITGYLSY 
NDIYCRFAECCREERPLNASALKLDLEEKLFGQH 
LATEVT\FKALTGFR]SrNK>JPKKPLTLSLHGWAGT 
GKNFVSOMGAFNLHPKGLKSNFVl-n FVSTT VTPP 

HEQKIKLYQDQLQKWIRGNVSACANSVFIFDEM 

DKL\HPGIIE\AIKPFLDYYEHVERVSYR\KAIFIFLS 
NAGGDLITKTALDFWRAGRKREDIOLKDLEPVT 

SVGVFNNKHSGLWHSGLIDKNLIDYFIPFLPLEYR 
H\^CVRAEMRARGSAIDED1VTRVAEEMTFFP\ 
RDEKIYSDKGCKTVQSRLDFH 


3545 


A 


3 


273 


SAQGRSWGRFYRQIKRHPGIIPMIGLICLGMGSA 

ALYLLR1JVIJR.SPDVW*SWDRKNNPEPWNRLSPN 

DQYKFLAVSTDYKKLKKDRPDF 


3546 


A 


23 


591 


ALSTETRTPDMRRLLLVTSLVVVLLWEAGAVPA 

PKVPIKMQVKHWPSEQDPEKAWGARVVEPPEK 

DDQLWLFPVQKPKLLTTEEKPRGQGRGPILPGT 

KAWMETEDTLGRVLSPEPDHDSLYHPPPEEDQG 

EERPRLWVMPNHQVLLGPEEDQDHIYHPQ*GSR 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C-Cysteine, D^Aspartic Acid, 
E>Glutamic Acid, F^PItenyialanine, G'=Glycine, H^Histidine, 
I='Isoleucine, K^Lysme, L^Leucine, M=Metliionine, 
N»Asparaglne, P^ProIine, Q=Glutamine, R^Argiiune, Serine, 
T^Threonine, V«Valine, W»Tryptoplian, Y=Tyrosine, 
X»l)nknown, *=Stop codon, /-possible nucleotide deletion, 
^possible nucleotide insertion 










GHHCPRPVPRPRLLGLGPSLPCPS 


3547 


A 


23 


591 


ALSTETRTPDMRRLLLVTSLVWLLWEAGAVPA 

PKVPIKMQVKHWPSEQDPEKAWGARVVEPPEK 

DDQLVVLFPVQKPKLLTTEEKPRGQGRGPILPGT 

BCAWMETEDTLGRVLSPEPDHDSLYHPPPEEDQG 

EERPRLWVMPNHQVLLGPEEDQDHIYHPQ*GSR 

GHHCPRPVPRPRLLGLGPSLPCPS 


3548 


A 


3 


1641 


TWLPSVPAEEVQQPEMAAVLNAERLEVSVDGLT 

LSPDPEERPGAEGAPLAAATAATALATWIRSRPG 

RLRGTARSPGRRAAGGAAEEARRLEQRWGFGLE 

ELYGLALRFFKEKDGKAFHPTYEEKLKLVALHK 

QVLMGPYNPDTCPEVGFFDVLGNDRRREWAAL 

GNMSKEDAMVEFVKLLmCCHLFSTYVASHKIE 

K££Q£KKRKEE££RRRR£££ER£RLQK£E£KRRR 

EEEERLRREEEERRRIEEERLRLEQQKQQIMAAL 

NSQTAVQFQQYAAQQYPGKYEQQQILIRQLQEQ 

HYQQYMQQLYQVQLAQQQAALQKQQEWVAG 

sslptsskvecnctovi*cofmroakthtdssfk;e 

lepeaaeealengpkeslpviaapsmwtrpqikd 

fkekiqqdadsvitvgrgevvtvrvptheegsyl 

fwefatdnydigfgvyfewtdsi^ntavsvhvse 

ssdddeeeeenigceekakknankplldeivpvy 

rrdcheevyagshqypgrgvyllkfdnsyslw 

rsksvyyrvyytr 


3549 


A 


1837 


3593 


pavlvlepasqsrkqqntasataqhwsaqihke 

sflapvftkdeqkhrrpyefeverdakargleqf 

sathghtpiilngwhgesamdlscssegspgats 

pfpvsastpkigaisslqgalgmdlsgilqaglihp 

vtgqivngslrrddaatrrrrgrrkhveggmd 

liflkeqtlqagilevhedpgqatlstihpegpgp 

atsapepataassqaeksdpskslldwlrqqady 

slevpgfganfsdkpkqrrprckepgkldvssls 

geervpaipkepglrgflpenkfnhtlaepilrdt 

gprrrgrrprsellkapsivadspsgmgplfmng 

liagmdlvglqnmrnmpgipltglvgfpagfat 

MPTGEEVKSTLSMLPNnS^LPGMAAWOMFGVGCr 

XTUl A VJJ^A^ V A^k^ A JL4kJAVAJ_/A A.TXi.TAA^X VJAVX<X(V V I >^^VA.A VJ V Vj Vj 

llsppmattctstapaslssttksgtavtektae 

dkpsshdvktdtlaedkpgpgpfsdqsepaittss 

pvafnpflipgvspgliypsmflspgmgmalpam 

qqarhseivglesqkrkkkktkgdnpnshpepa 

pscerepsgdencaepsaplpaerehgaqagega 

lkdsnndtn 


3550 


A 


287 


39 


qllsflnioatsqkhrdfvaesvgekpvgslagige 
vmdkkleegcfdkayvvlgqflvlkkdedlf*e 
wlrdtggartrgsre 


3551 


A 


21 


3925 


gdllevglppglefprgiclrglrrtmsldfgsv 

alpvqnedeeydeedyerekelqqlltdlphdm 

ldddlsspelqysdcsedgtdgqphhpeqlems 

wneqmlpksqsvngpscqglepynkvtykpyqs 

saqnngspaqeitgsdtfeglqqqflganensae 

nmqiiqlqvlnkakerqlenlieklneserqiry 

lnhqlviikdekdgltlslresqklfqngkereiq 

leaqikaletqiqalkvneeqmikksrttemale 

slkqqlvdlhhseslqrareqhesivmgltkky 

eeqvlslqknldatvtalkeqedicsrlkdhvk 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^Alanine C=Cysteine, D=Aspartic Acid, 
I>=Glutamic Acid, F=Phenylalanine, G<=Glycme, H^^Histidlne, 
I^Isolendne, K=Lysine, LF'LeuciDe, Ms=Methionine, 
N^Asparagine, P=ProUne, Q»Glutamine, R»Arginine, SnSerine, 
T=ThreonInc, V=Valine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *==Stop codon» /^possible nucleotide deietion, 
\=possible nucleotide insertion 






1 




QLERNQEAIKLEKTEIINKLTRSLEESQKQCAHLL 

QSGSVQEVAQLQFQLQQAQKAHAMSANMNKA 

LQEELTELKDEISLYESAAKLGIHPSDSEGELNIEL 

TESYVDLGIKKVNWKKSKVTSIVQEEDPNEELSK 

DEFILBCLKAEVQRLLGSNSMKRHLVSQLQNDLK 

DCHKKIEDLHQVKKDEKSIEVETKTDTSEKPKNQ 

LWPESSTSDVVRDDILLLKNEIQVLQQQNQELKE 

TEGKLRNTNQDLCNQMRQMVQDFDHDKQEAV 

DRCERTYQQHHEAMKTQIRESLLAKHALEKQQL 

FEAYERTHLQLRSELDKLNKEVTAVQECYLEVC 

REKDKLELTLRKTTEKEQQTQEKIKEKLIQQLEK 

EWQSKLDQTIKAMKKKTLDCGSQTDQVTTSDVI 

SKKEMAIMIEEQKCTIQQNLEQEKDIAIKGAMKK 

LEIELELKHCENITKQVEIAVQNAHQRWLGELPE 

LAEYQALVKAEQKKWEEQHEVSVNKRISFAVSE 

AKEKWKSELENMRKNILPGKELEEKIHSLQKELE 

LKNEEVPVVIRAELAKARSEWNKEKQEEIHRIQE 

QNEQDYRQFLDDHRNKINEYLAAAKEDFMKQK 

TELLLQKETELQTCLDQSRREWTMQEAKRIQLEI 

YQYEEDILTVLGVLLSDTQKEHISDSEDKQLLEI 

MSTCSSKWMSVQYFEKLKGCIQKAFQDTLPLLV 

ENADPEWKKKNMAELSKDSASQGTGQGDPGPA 

AGHHAOPLALOATEAEADKKKVLEIiCDLCCGHC 

FQELEKAKQECQDLKGKLEICCCRHLQHLERKHK 

AVVEKIGEE>nsnCVVEELIEENNDMKNKLEELQT 

LCKTPPRSLSAGAIENACLPCSGGALEELRGQYIK 

AVKKIKCDMLRYIQESKERAAEMVKAEVL*ERQ 

ETARKMRKYYLICLQQILQDDGKEGAEKKIMNA 

ASKLATMAKLLETPISSKSQSKTTQSGMSK 


3552 


A 


771 


375 


ARTRQTSGQAREPEKESPAPGGGGLAEIRSRQQL 
SQTSBUPPLAKDQAVEAMFPPARGKELLSFEDVA 
MYFTREEWGHLNWGQKDLYRDVMLENYKNMV 
LLVYFQFDAAIPLC*TSLAHSSWLQLYFRLYF 


3553 


A 


76 


72 


PGVRGVEAPGGVAPGRNAMRRGERRDAGGPRP 

ESPVPAGRASLEEPPDGPSAGQATGPGEGRRSTE 

SEVYDDGTNTFFWRAHTLTVLFILTCTLGYVTLL 

EETPQDTA YNTKRGI V A SIL VFLCFGVTQ AKDGP 

FSRPHPAYWRFWLCVSVVYELFLIFILFQTYQDG 

RQFLKYVDPKLGVPLPERDYGGNCLIYDPDNET 

DPFHNIWDKLDGFVPAHFLGWYLKTLMIRDWW 

MCMnSVMFEFLEYSLEHQLPNFSECWWDHWIM 

DVLVCNGLGIYCGMKTLEWLSLKTYKWQGLWN 

IPTYKGKMKRIAFQFTPYSWVRFEWKPASSLRR 

WLAVCGIILVFLLAELNTFYLKFVLWMPPEHYLV 

LLRLVFFVNVGGVAMREIYDFMDDPKPHKKLGP 

QAWLVAAITATELLIVVKYDPHTLTLSLPFYISQC 

WTLGSVLALTWTVWRFFLRDITLRYKETRWQK 

WQNKDDQGSTVGNGDQHPLGLDEDLLGPGVAE 

GEGAPTPN*PRGPAPRPLPSAPRAVCGASSRR 


3554 


A 


2 


2106 


FDEFSALPSPSLQTSWSFGPMSRRALRRLRGEQR 

GQEPLGPGALHFDLRDDDDAEEEGPBaiELGVRR 

PGGAGKEGVRVNNRFELINIDDLEDDPWNGERS 

GCALTDAVAPGNKGRGQRGNTESKTDGDDTET 

VPSEQSHASGKLRKKKKKQKNKKSSTGEASENG 

LEDIDRILERIEDSTGLNRPGPAPLSSRKHVLYVE 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, D-Aspartic Acid» 
E^Giutamic Acid, F^Phenylalanine, G»Glycine, H«»Histidine, 
I=Isolcacine, K»Lysine, L^Leucine, M^Methionine, 
N^Asparagine, P-Proline, Q=Glutamine, R^Arginine, S=^eriae, 
T^Threonine, V^Valine, W»Tryptophan, Y=Tyrosine» 
XBUnknown, ^^Stop codon, A^possible nucleotide deletion, 
\r=possible nucleotide insertion 




• 






HRHLNPDTELKRYFGARAILGEQRPRQRQRVYP 

KCTWLTTPKSTWPRYSKPGLSMRLLESKKGLSFF 

AFEHSEEYQQAQHKFLVAVESMEPNNTWLLQT 

SPYHVDSLLQLSDACRFQEDQEMARDLVERALY 

SMECAFHPLFSLTSGACRLDYRRPENRSFYLALY 

KQMSFLEKRGCPRTALEYCKLILSLEPDEDPLCM 

LLLIDHLALRAKNYEYLIRLFQEWEVGASLAHIW 

LSQLPNFAFSVPLAYFLLSQQTDLPECEQSSARQ 

RFFGPNAEISQPPALSQLVNLYLGRSHFLWKEPA 

TMSWLEENVHEVLQAVDAGDPAVEACENRRKV 

LYQRAPRNIHRHVILSEIKEAVAALPPDVTTQSV 

MGFDPLPPSDTIYSYVRPERLSPISHGNTIALFFRS 

LLPNYTMEGERPEEGVAGGLNRNQGL>IRLMLA 

VRDMMANFHLNDLEAPHEDDA * GEGE WD 


3555 


A 


2 


2106 


FDEFSALPSPSLQTSWSFGPMSRRALRRLRGEQR 

GQEPLGPGALHFDLRDDDDAEEEGPKRELGVRR 

PGGAGKEGVRVNNRFELINIDDLEDDPVVNGERS 

GCALTDAVAPGNKGRGQRGNTESKTDGDDTET 

WSEQSHASGKLRKKKKKQKNKKSSTGEASENG 

LEDIDRILERIEDSTGLNRPGPAPLSSRKHVLYVE 

HRHLNPDTELKRYFGARAILGEQRPRQRQRVYP 

KCTWLTTPKSTWPRYSKPGLSMRLLESKKGLSFF 

AFEHSEEYQQAQHKFLVAVESMEPNNIVVLLQT 

SPYHVDSLLQLSDACRFQEDQEMARDLVERALY 

SMECAFHPLFSLTSGACRLDYRRPENRSFYLALY 

KQMSFLEKRGCPRTALEYCKLILSLEPDEDPLCM 

LLLIDHLALRARNYEYLIRLFQEWEVGASLAHRN 

LSQLPNFAFSVPLAYFLLSQQTDLPECEQSSARQ 

RFFGPNAEISQPPALSQLVNLYLGRSHFLWKEPA 

TMSWLEENVHEVLQAVDAGDPAVEACENRRKV 

LYQRAPRNIHRHVILSEIKEAVAALPPDVTTQSV 

MGFDPLPPSDTIYSYVRPERLSPISHGNTIALFFRS 

LLPNYTMEGERPEEGVAGGLNRNQGL>iRLMLA 

VRDMMANFHLNDLEAPHEDDA*GEGEWD 


3556 


A 


3388 


1650 

• 


KTRGTMFYYPNVLQRHTGCFATIWLAATRGSRL 

VKREYLRVNVVKTCEEILNYVLVRVQPPQPGLP 

RPRFSLYLSAQLQIGVIRVYSQQCQYLVEDIQHIL 

ERLHRAQLQIRIDMETELPSLLLPNHLAMMETLE 

DAPDPFFGMMSVDPRLPSPFDIPQIRHLLEAAIPE 

RVEEIPPEVPTEPREPERIPVTVLPPEAITILEAEPIR 

MLEIEGERELPEVSRRELDLLIAEEEEAILLEIPRL 

PPPAPAE*GQELLDQVGCQCWEGSPHFSCPFPLR 

VEGMGEALGPEELRLTGWEPGALLMEVTPPEEL 

RLPAPPSPERRPPVPPPPRRRRRRRLLFWDKETQI 

SPEKFOEOLOTRAHCWECPMVOPPERTIRGPAET 

FRTPTLSGWLPPELLGLWTHCAQPPPKALRRELP 
EEAAAEEERRKIEVPSEIEVPREALEPSVPLMVSL 
EISLEAAEEEKSRISLIPPEERWAWPEVEAPEAPA 
LPVVPELPEVPMEMPLVLPPELELLSLEAVHRAV 
ALELQANREPDFSSLVSPLSPRRMAARVFYLLLV 
LSAQQILHVKQEKPYGRLLIQPGPRFH 


3557 


A 


3388 


1650 


KTRGTMFYYPNVLQRHTGCFATIWLAATRGSRL 
VKREYLRVNWKTCEEILNYVLVRVQPPQPGLP 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted eod 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AJanine C=Cy5teine, D«A5partic Acid, 
'E=Glutamic Acid, F=»Phenylalanine, G^GIydne, H'^HIstidine, 
I»Iso!eucine, K"=Lysine, LF*Leucine, M°MetMonlne, 
N^Asparagine, P=Proline, (^Glutamine, R»Arginine, Serine, 
T»Threonlne, V=Vaiinc, W=Tryptophan, Y«=Tyrosine, 
XsUnknoTvn, *=^top codoo, ^^ssible nucleotide deletion, 
V»pos$ible nucleotide insertion 










RPRFSLYLSAQLQIGVIRVYSQQCQYLVEDIQHIL 

ERLHRAQLQDUDMETELPSLLLPNHLAMMETLE 

DAPDPFFGMMSVDPRLPSPFDIPQIRHLLEAAIPE 

RVEEIPPEVPTEPREPERIPVTVLPPEAITILEAEPIR 

MLEIEGERELPEVSRRELDLLIAEBEEAILLEIPRL 

PPPAPAE*GQELLDQVGCQCWEGSPHFSCPFPLR 

VEGMGEALGPEELRLTGWEPGALLMEVTPPEEL 

RLPAPPSPERRPPVPPPPRRRRRRRLLFWDKETQI 

SPEKFOEOT OTRAHCWRCPMVOPPFRTTRGPAFT 

FRTPTLSGWLPPELLGLWTHCAQPPPKALRRELP 

EEAAAEEERRKIEVPSEIEVPREALEPSVPLMVSL 

EISLEAAEEEKSRISLIPPEERWAWPEVEAPEAPA 

LPVVPELPEVPMEMPLVLPPELELLSLEAVHRAV 

ALELQANREPDFSSLVSPLSPRRMAARVFYLLLV 

LSAQQILHVKQEKPYGRLLIQPGPRFH 


3558 


A 


489 


2360 

* 


IRPRPRGRRRALDSPNAAAPPVYVCRSPGEPTSL 

VNMASEDIAKLAETLAKTQVAGGQLSFKGKSLK 

LNTAEDAKDVIKEIEDFDSLEALRLEGNTVGVEA 

ARVIAKAL*KKSELKRCHWSDMFTGRLRTEIPPA 

LISLGEGLITAGAQLVELDLSDNAFGPDGVQGFE 

ALLKSSACFTLQELKLNNCGMGIGGGKILAAALT 

ECHRKSSAQGKPLALKVFVAQRNRLENDGATAL 

AEAFRVIGTLEEVHMPQNGINHPGITALAQAFAV 

NPLLRVINLNDNTFTEKGAVAMAETLKTLRQVE 

VINFGDCLVRSKGAVAIADAIRGGLPKLKELNLS 

FCEIKRDAALAVAEAMADKAELEKLDLNGNTLG 

EEGCEQLQEVLEGFNMAKVLASLSDDEDEEEEE 

Fff FFFFFF A FFFFFFnFFFFFFFFFFFFFFPOnR C\ 

QGEKSATPSRKILDPNTGEPAPVLSSPPPADVSTF 
LAFPSPEKLLRLGPKSSVLUQQTDTSDPEKWSA 
FLKVSSVFKDEATVRIVIAVQDAVDALMQKAFNS 
SSFNSNTFLTRLLVHMGLLKSEDKVKAIANLYGP 
LMALNHMVQQDYFPKALAPLLLAFVTKPNSALE 
SCSFARHSLLQTLYKV 


3559 


A 


489 


2360 


IRPRPRGRRRALDSPNAAAPPVYVCRSPGEPTSL 

VNMASEDIAKLAETLAKTQVAGGQLSFKGKSLK 

LNTAEDAKDVIKEIEDFDSLEALRLEGNTVGVEA 

ARVIAKAL*KKSELKRCHWSDMFTGRLRTEIPPA 

LISLGEGLITAGAQLVELDLSDNAFGPDGVQGFE 

ALLKSSACFTLQELKLNNCGMGIGGGKILAAALT 

ECHRKSSAQGKPLALKVFVAGRNRLENDGATAL 

AEAFRVIGTLEEVHMPQNGINHPGITALAQAFAV 

NPLLRXONLNDNTFTEKGAVAMAETLKTLRQVE 

VINFGDCLVRSKGAVAIADAIRGGLPKLKELNLS 

FCEQCJRDAALAVAEAMADKAELEKLDLNGNTLG 

EEGCEQLQEVLEGFNMAKVLASLSDDEDEEEEE 

EGEEEEEEAEEEEEEDEEEFEFFFFFFFFFPOORn 

QGEKSATPSRKILDPNTGEPAPVLSSPPPADVSTF 
LAFPSPEKLLRLGPKSSVLIAQQTDTSDPEKVVSA 
FLKVSSVFKDEATVRMAVQDAVDALMQKAFNS 
SSFNSNTFLTRLLVHMGLLKSEDKVKAIANLYGP 
LMAL^lHMVQQDYFPKALAPLLLAFVTKPNSALE 
SCSFARHSLLQTLYKV 


3560 


A 


2 


1198 


FVRELPRPRPGAATAAIMVSVINTVDTSHEDMIH 
DAQMDYYGTRLATCSSDRSVKIFDYRNGGQILIA 
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S£Q ID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A='Alaninc C=Cysteine, D^Aspartic Acid, 
E<<^lutamic Add, F»Plienylalanine, GKSIycine, H»Hjstidinc, 
IsISGleucine, K^Lysine, U>Leucine, M^sMetliionine, 
N^Asparagine, P^Proline, Q=GlQtaniine, R'^Arginine, SHSerinc, 
T=Threonine, V=VaIine, W=Tryptoplian, Y==Tyroslne, * 
X<=Unknown, *=Stop codon, A=posslble nucleotide deletion, 
V^possible nucleotide insertion 










DLRGHEGPVWQVAWAHPMYGNILASCSYDRKV 

IIWREENGTWEKSHEHAGHDSSVNSVCWAPHDY 

GLILACGSSDGAISLLTYTGEGQWEVKKINNAHT 

IGCNAVSWAPAVVPGSLIDHPSGQKPNYIKRFAS 

GGCDNLIKLWKEEEDGQWKEEQKLEAHSDWVR 

DVAWAPSIGLPTSTIASCSQDGRVFIWTCDDASS 

NTWSPKLLHKFNDVVWHVSWSITANILAVSGGD 

NK\nrLWKESVDGQWVCISDVNKGQGSVSASVT 

EGQQNEQ*QDRWGLAPHPPAPGLPLPGPTNQTT 

GKSPQLQQDYFPRRSYRCSHRLIICLNVIGDAL 


3561 


A. 


540 


86 


WRVKEMTSTLPKALGRKTASRSHTTLQGGSCCP 

VLWTAKLRCRKJLRFPLPPPPPSSSAWPWQGWGI 

RGEQEAEGPLGETGPPVGPELSGLRQWRKLIKGR 

YGEWRGSGQKTGQPS*TTMQGGETEENRTETTT 

GNKQRESEAPWVRHTYIT 


3562 


A 


1920 


242 


PMMAMPFFERFKSSIQRPSPVLVLSQNTKRESGR 

KVQSGNINAAKTL'a)IIRTCLGPKSMMKMLLDP 

MGGIVMTNDGNAILREIQVQHPAAKSMIEISRTQ 

DEEVGDGTTSVIILAGEMLSVAEHFLEQQMHPTV 

VISAYRKALDDMISTLKKISIPVDISDSDMMLNIIN 

SSITTKAISRWSSLACNIALDAVKMVQFEENGRK 

EIDIKKYARVEKIPGGnEDSCVLRGVMINKDVTH 

PRMRRYIKNPRIVLLDSSLEYKKGESQTDIEITRE 

EDFTRILQMEEEYIQQLCEDUQLKPDWITEKGIS 

DLAQHYLMRANITAIRRVRKTDNNRIARACGARI 

VSRPEELREDDVGTGAGLLEIKKIGDEYFTFITDC 

KDPKACriLLRGASKEILSEVERNFQDAMQVCRN 

VLLDPQLVPGGGASEMAVAHALTEKSKAMTGV 

EQWPYRAVAQALEVIPRTLIQNCGASTIRLLTSLR 

AKHTQENCETWGVNGETGTLVDMKELGIWEPL 

AVKLQTYKTAVETAVLLLRIDDIVSGHKKKGDD 

QSRQGGAPDAGQE 


3563 


A. 


1571 


560 


GPSLLGmGTPNPARTLQIFFLIIGRRLTGRMAAV 

DDLQFEEFGNAATSLTANPDATTVNIEDPGETPK 

HQPGSPRGSGREEDDELLGNDDSDKTELLAGQK 

KSSPFWTFEYYQTFFDVDTYQVFDRIKGSLLPPG 

KNFVRLYIRSNPDLYGPFWICATLVFAIAISGNLS 

NFLIHLGEKTYHYVPEFRKVSIAATIIYAYAWLVP 

LALWGFLMWRNSKVMNIVSYSFLEIVCVYGYSL 

FIYIPTAILWIIPHKAVRWILVMIALGISGSLLAMT 

FWPAVREDNRRVALATIVTIVLLHMLLSVGCLA 

YFFDAPEMDHLPTTTATPNQTVAAAKSS 


3564 


A. 


1 


328 


NSRVDDFVAHLQRPLLGPASCLGILRPAMTAHSF 
ALPGIIFTTFWGLVGIAGPWFVPKGPNRGVIITML 
VATAVCCYLFWLIAILAQLNPLFGPQLKNETIWY 

VRFLWE 


3565 




2 


1081 


fvtdfparsmaatslmsalaarllqpahscslrl 

rpfhlaavrneavvisgrklaqqikqevrqevee 

wvasgwkrphlsvelvgenpashsyvlnktraa 

avvginsetinikpasiseeellnlinkl>wddnvd 

gllvqlplpehiderricnavspdkdvdgfhvim 

vgrmcldqysmlpatpwgvweiikrtgiptlgk 

nvvvagrsknvgmpiamllhtdgaherpggda 

rvtishrytpkeqlkkhtiladrvisaagepnlita 

dmucegaavddvgimrvhdpvtakpklvgdvdf 
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SEQID 
NO: 


Method 


Predicted 
beginning 
nucleotide 
location 

corresponding 
to first amino 
acid residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=»Alanine C'^Cysteine, D^^Aspartic Acid, 
£=Glutaniic Acid, F^Phenylalanine, G=Glycioe, H-Histidioe, 
I»Isoleadne, K=Lysine, L=Leucine, M-Metliionine, 
N=<Asparaslne* P=ProUne, Q^Iutamine, R— Areinine.S>^erine. 
T»Tbreonine, VeVallne, W«Tryptophan, V^Tyrosine, 
X=Unl(nown, *=Stop codon, /^possible nucleotide deledon, 
V=possible nucleotide insertion 










EGVRQKAGYITPVPGGVGPMTVAMLMKNTIIAA 
KKVLRLEEREVLKSKELGVATN 


3566 


A 


3 


1130 


SCRRGRQQQRRIsfVSLSSQFAHTMAAPAQQTTQP 

GGGKRKGKAQYVLAKRARRCDAGGPRQLEPGL 

QGILITCNMNERKCVEEAYSLLNEYGDDMYGPE 

KFTDKDQQPSGSEGEDDDAEAALKKEVGDIKAS 

TEMRLRRFQSVESGAKNVVFIRTLGIEPEKLVHHI 

LQDMYKTKKKKTRVILRMLPISGTCKAFLEDMK 

KYAETFLEPWFKAPNKGTFQIVYKSRNNSHVNR 

EEVIRELAGIVCTLNSENKVDLTNPQYTWVEIIK 

AVCCLSVVKDYMLFRKYNLQEVVKSPKDPSQLN 

SKQGNGKEAKLESADKSDQNNTAEGKNNQQVP 

ENTEELGQTKPTSNPQVVNEGGAKPELASQATE 

GSKSNENDFS 


3567 


A 


248 


3498 

< 

-A 


GKKDSSPWTCPFHPPLQLFFVIRNTRQLGDFHLA 

mVRNYWTADGDLDIGAKNVKLYVNRl^IFNG 

KLDKGDREAPADHSILVDQKNEKSEQLEEAMNA 

HSEESKGTHEMAGASGDKELGLGCSPPAETLAD 

AKLSSQGNVSGKRKNSTNCRKDSLSQLEEYLRLS 

AVPTSMGDMPSAPATSPPVKCPPVHEEPSLIQQL 

ENLMGRKICEPPGKTPSWLQPSFTGKDRKQGGR 

KPKPLWLSPEKPLAWKGRLPSDDVIGEGPGETEA 

RDKGLRHEPGWGTSRSVNTKERPQRATTKVHSD 

DSDIFNQPPNRERPASGRRGSRKDAGSSSHGDDQ 

PASREDTWSSRTPSRSRWRSEQEHTLHESWSSLS 

AFDRSHRGRISNTELPGDILDELLQQKSSRHSDLP 

PSKKGEQPGLSRGQDGYSGETDAGGDFKIPVLPY 

GQRLVIDIKSTWGDRHYVGLNGIEIFSSKGEPVQI 

SNIKADPPDINILPAYGKDPRWTNLIDGVNRTQ 

DDMHVWLAPFTRGRSHSITIDFTHPCHVALIRJW 

NYNKSRfflSFRGVKDITMLLDTQCIFEGEIAKASG 

TLAGAPEHFGDTELFTTDDDILEAIFYSDEMFDLD 

VGSLDSLQDEEAMRRPSTADGEGDERPFTQAGL 

GADERIPELELPSSSPVPQVTTPEPGIYHGICLQLN 

FTASWGDLHYLGLTGLEVVGKEGQALPIHLHQIS 

ASPRDLNELPEYSDDSRTLDKLIDGTNITMEDEH 

MWLIPFSPGLDHVVTIRLDRAESIAGLRFWNYNK 

SPEDTYRGAKIVHVSLDGLCVSPPEGFLIRKGPG 

NCHFDFAQEILFVDYLRAQLLPQPARRLDMRSLE 

CASMDYEAPLMPCGFIFQFQLLTSWGDPYYIGLT 

GLELYDERGEKIPLSENNIAAFPDSVNSLEGVGG 

DVRTPDKLIDQYNDTSDGRHMWLAPILPGLVNR 

VYVIFDLPTTVSMIKLWNYAKTPHRGVKEFGLL 

VDDLLVYNGILAMVSHLVGGILPTCEPTVPYHTI 

LFTEDRDIRHQEKHTTISNQAEDQDVQMMNENQ 

UTNAKRKQSVVDPALRPKTCISEKETRRRRC 


3568 


A 


50 


1724 


AQGGTLSAASRFCRGGLLGPWLHPASEMAATLD 

LKSKEEKDAELDKRIEALRRKNEALIRRYQEIEE 

DRKKAELEGVAVTAPBUCGRSVEKENVAVESEKK 

LGPSRRSPGTPRPPGASKGGRTPPQQGGRAGMG 

RASRSWEGSPGEQPRGGGAGGRGRRGRGRGSPH 

LSGAGDTSISDRKSKEWEERRRQNIEKMNEEME 

KIAEYERNQREGVLEPNPVRNFLDDPRRRSGPLE 

ESERDRREESRRHGRNWGGPDFERVRCGLEHER 

QGRRAGLGSAGDMTLSMTGRERSEYLRWKQER 



385 



wo 01/57190 



PCT/USOl/04098 



S£QID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanlne C=Cystcine, D=A8partic Acid, 
EBGlutamic Acid, F°Phenylalanine, G-'Glycine, H=Histidine, 
falsoleudne, K»Lysine, L^Leucine, M»Methionine, 
N=A5paragine» PsProUne, OHjlutamiDC, R^Arginioe, S»Serine, 
T^ThreoDine, V==VaIlnc, W=Tryptophan, Y^iyrosine, 
X«Unknown, *"Stop codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion 










EKmQERLQRHRKPTGQWRREWDAEKTDGMFK 

DGPVPAHEPSHRYDDOAWARPPKPPTFGEFLSO 

HKAEASSRRRRKSSRPQAKAAPRAYSDHDDRWE 

TKEGAASPAPETPQPTSPETSPKETPMQPPEIPAP 

AHRPPEDEGEENEGEEDEEWEDISEDEEEEEIEVE 

EGDEEEPAQDHQAPEAAPTGIPCSEQAHGVPFSP 

EEPLLEPQAPGTPSSPFSPPSGHQPVSDWGEEVEL 

NSPRTTHLAGALSPGEAWPFESV 


3569 


A 


1 


912 


MGRVGRAGVQLGRRRTTWAAERTGQAAAGGP 
GRALRGQRPDLRSGGAADSPAAGRGELYCGVLP 

RSPWFLSERRRQMADFDTYDDRAYSSFGGGRGS 
RG SAGGHG SRSOKELPTEPP YT A YVGNl PFNTV 

QGDIDAIFKDLSIRSVRLVRDKDTDKFKGFCYVE 

FDEVDSLKEALTYDGALLGDRSLRVDIAEGRKQ 

DKGGFGFRKGGPDDRGFRDDFLGGRGGSRPGDR 

RTGPPMGSRFRDGPPLRGSNMDFREPTEEERAQR 

PRLQLKPRTVATPLNQVANPNSAIFGGARPREEV 

VQKEQE 


3570 


A 


1 


912 


MGRVGRAGVQLGRRRTTWAAERTGQAAAGGP 

GRALRGQRPDLRSGGAADSPAAGRGELYCGVLP 

RSPWFLSERRRQMADFDTYDDRAYSSFGGGRGS 

QGDIDAIFKDLSIRSVRLVRDKDTDKFICGFCYVE 
FDEVDSLKEALTYDGALLGDR^T RVTjTAFrtRK'O 

DKGGFGFRKGGPDDRGFRDDFLGGRGGSRPGDR 
RTGPPMGSRFRDGPPLRGSNMDFREPTEEERAQR 
PRLQLKPRTVATPLNQVANPNSAIFGGARPRJEEV 
VQKEQE 


3571 


A 


28 


131 


RHFFGNLCAMRAKWRKKRMRRLKRKRRKMRQ 
RSK 


3572 


A 


3 


1202 


QSEPHRKVRVDPPVRDRPPPHPPPLLVQRALPGQ 

GQAEGSDGADGAKRRAMAHQTGIHATEELKEFF 

AKARAGSVRLIKWIEDEQLVLGASQEPVGRWD 

QDYDRAVLPLLDAQQPCYLLYRLDSQNAQGFE 

WLFLAWSPDNSPVRLKMLYAATRATVKKEFGG 

GHIKDELFGTVKDDLSFAGYOKHT SSPAAPAPI T 

SAERELQQIRINEVKTEISVESKHQTLQGLAFPLQ 

PEAQRALQQLKQKMVNYIQMKLDLERETIELVH 

TEPTDVAQLPSRVPRDAARYHFFLYKHTHEGDP 

LESVVFIYSMPGYKCSIKERMLYSSCKSRLLDSV 

EQDFHLEIAKKIEIGDGAELTAEFLYDEVHPKQH 

AFKQAFAKPKGPGGKRGHKRLIRGPGENGDDS 


3573 


A 


49 


1869 


PHCEPNPGAGAMVLLHVLFEHAVGYALLALKEV 

EEISLLQPQVEESVLNLGKFHSIVRLVAFCPFASS 

QVALENANAVSEGWHEDLRLLLETJILPSKKKK 

VLLGVGDPKIGAAIQEELGYNCQTGGVIAEILRG 

VRLHFHNLVKGLTDLSACKAQLGLGHSYSRAKV 

KFNV^VDNMIIQSISLLDQLDKDINTFSMRVRE 

WYGYHFPELVKIINDNATYCRLAOFIGNRRELNE 

DKLEKLEELTMDGAKAKAILDASRSSMGMDISAI 

DLINIESFSSRWSLSEYRQSLHTYLRSKMSQVAP 

SLSALIGEAVGARLIAHAGSLTNLAKYPASTVQIL 

GAEKALFRALKTRGNTPKYGLIFHSTFIGRAAAK 

NKGRISRYLAISIKCSIASRIDCFSEVPTSVFGEKLR 

EQVEERLSFYETGEIPRKNLDVMKEAMVQAEAE 
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SEQID 
NO: 


Metiiod 


Predicted 

beginning 

nucleotide 

locatioD 

corresponding 

to first amino 

acid residue of 

pepnae 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C^Cysteine, D^Aspartic Acid, 
E<<:tutamic Acid, F^Phenylalanine, G=Glycine, H=^Histidinc, 
lalsoleuclne, K-Lysine, L=Leacine, M=Methionine, 
N-Asparaginc, PaProline, Q^GIutamine, RBArginine, S=Serine, 
T==Threonine, V=Vallne, W^Tryptophant Y-Tyroslne, 
X^Unknown, *=Stop codon, /-^possible nadeotldc deletion, 
\=possible nucleotide insertion 










EAAAEITIOaEKQEKKRLKKEKKRLAALALASS 

ENSSSTPEECEETSEKPKKKKKQKPQEVPQENGM 

EDPSISFSKPKKKKSFSKEELMSSDLEETAGSTSIP 

KRKKSTPKEETV^TOPEEAGHRSRSKKKRKFSKEE 

PVSSGPEEAVGKSSSKKKKKFHKASQED 


3574 


A 


284 


2032 


CGNERTARLWVQPVVSTMPQASEHRLGRTREPP 

VNIQPRVGSKLPFAPRARSKERRNPASGPNPMLR 

PLPPRPGLPDERLKKLELGRGRTSGPRPRGPLRA 

DHGVPLPGSPPPTVALPLPSRTNLARSKSVSSGDL 

RPMGIALGGHRGTGELGAALSRLALRPEPPTLRR 

STSLRRLGGFPGPPTLFSIRTEPPASHGSFHMISAR 

SSEPFYSDDKMAHHTLLLGSGHVGLRlSrLGNTCF 

LNAVLQCLSSTRPLRDFCLRRDFRQEVPGGGRA 

QELTEAFADVIGALWHPDSCEAVNPTRFRAVFQ 

KYVPSFSGYSQQDAQEFLKLLMERLHLEINRRGR 

RAPPILANGPVPSPPRRGGALLEEPELSDDDRANL 

MWKRYLEREDSKTVDLFVGOLKSCLKCOACGY 

RSri FEVFCDLSLPIPKKGFAGGKVSLRDCFNLFT 

KEEELESENAPVCDRCRQKTRSTKKLTVQRFPRi 

LVLHLNRFSASRGSKKSSVGVDFPLQRLSLGDF 

ASDKAGSPVYQLYALCNHSGSVHYGHYTALCR 

CQTGWHVYNDSRVSPVSENQVASSEGYVLFYQL 

MQEPPRCL 


3575 


A 


1 


2408 


RELDSLADLPERIKPPYANGLSTSHLRSSSVEDVK 

LUSEGRPTIEV!«ICSMPSVICEHTKQFQTISEESN 

QGSLLTVPGDTSPSPKPEVFSNVPERDLSNVSNIH 

SSFATSPTGASNSKYVSADRNLIKNTAPVNTVMD 

SPVHLEPSSQVGVIQNKSWEMPVDRLETLSTRDF 

ICPNSNIPDQESSLQSFCNSENKVLKENADFLSLR 

QTELPGNSCAQDPASFMPPQQPCSFPSQSLSDAES 

ISKHMSLSYVANQEPGILQQKNAVQUSSALDTD 

NESTKDTENTFVLGDVQKTDAFVPVYSDSTIQEA 

SPNFEKAYTLPVLPSEBCDFNGSDASTQLNTHYAF 

SKLTYKSSSGHEVENSTTDTQVISHEKENKLESL 

VLTHLSRCDSDLCEMNAGMPKGNLNEQDPKHC 

PESEKCLLSIEDEESQQSILSSLENHSQQSTQPEM 

HKYGQLVKVELEENAEDDKTENQIPQRMTRNK 

ANTMANQSKQILASCTLLSEKDSESSSPRGRIRLT 

EDDDPQIHHPRKRKVSRVPQPVQVSPSLLQAKEK 

TQQSLAAIVDSLKLDEIQPYSSERANPYFEYLHIR 

KKIEEKRKLLCSVIPOAPOYYDEYVTFNGSYLLD 

GNPLSKICIPTITPPPSLSDPLKELFRQQEVVRMKL 

RLQHSIEREKLIVSNEQEVLRVHYRAARTLANQT 

LPFSACTVLLDAEVYNVPLDSQSDDSKTSVRDRF 

NARQFMSWLQDVDDKFDKLKTCLLMRQQHEA 

AALNAVQRLEWQLiaQELDPATYKSISIYEIQEF 

YVPLVDVNDDFELTPI 


3576 


A 


5 


1421 


LRLAWHDGARWPLGTPRAAATRREAAALPPVT 

LALLCLDGVFLSSAENDFVHRIQEELDRFLLQKQ 

LSKVLLFPPLSSRLRYLIHRTAENFDLLSSFSVGE 

GWKRRTVICHQDIRVPSSDGLSGPCRAPASCPSR 

YHGPRPISNQGAAAVPRGARAGRWYRGRKPDQ 

PLYVPRVLRRQEEWGLTSTSVLKREAPAGRDPEE 

PGDVGAGDPNSDQGLPVLMTQGTEDLKGPGQR 

CENEPLLDPVGPEPLGPESQSGKGDMVEMATRF 
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S£QD> 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A»Alanine C=Cysteinc, D=Aspartic Acid, 
E^Glutaroic Acid, F=Phenylalanine, G^lycine, H^IfistidiDey 
I»Isoleucine, K^Lyslne, L^Leuctne, M^Metfalonine, 
N=Asparagine, P^ProIine, QBGlutamine, R»Arginine, S»Serine, 
T^Threoninc, V-Vallne, W"Tryptophan, YaTyrosine, 
Xs^Unknown, "^^Stop codon, /==possible nucleotide deletion, 
\=poss{Me nucleotide insertion 










GPSSCSEDDYSELLQEITDNLTKKEIQIEKIHLDTS 

SFMEELPGEKDLAHVVEIYDFEPALKTEDLLATF 

SEFQEKGFRIQWVDDTHALGIFPCRASAAEALTR 

EFSVLKIRPLTQGTKQSKLKALQRPKLLRLVKER 

PQTNATVARRLVARALGLQHKKKERPAVRQPLP 

P 


3577 


A 


102 

* 


1998 


DTRTPGSLEMGPLQFRDVAIEFSLEEWHCLDTAQ 

RNLYRNVMLENYSNLVFLGIVVSKPDLIAKLEQG 

KI<a>LTMKRHEMVANPSGPVICSHFAQDLWPEQN 

IKDSFQKVILRRYEKRGHGNLQLIKRCESVDECK 

VHTGGYNGLNQCSTTTQSKVFQCDKYGKVFHK 

FSNSNRHNIRHTEKKPFKCIECGKJ^QFSTLITH 

KKmTGEKPYICEECGKAFKYSSALNTHKRIHTG 

EKPYKCDKCDKAFIASSTLSKHEIIHTGKKPYKCE 

ECGKAFNQSSTLTKHKKIHTGEKPYKCEECGKAF 

NQSSTLTKHKKIHTGEKPYVCEECGKAFKYSRIL 

TraKRIHTGEKPYKO^CGKAFL^SSTLSR^ 

MGKKHYKCEECGKAFIWSSVLTRHKRVHTGEKP 

z ]vl^ilJ^\^(jrJSJ\T Js. X oa 1 LooiiJSJvoil iiJCllU' YJvLlSbC 

GKAFVASSTLSKHEIIHTGKKPYKCEECGKAFNQ 

SSSLTKHKKIHTGEKPYKCEECGKAFNQSSSLTK 

HKmXGEKPYKCEECGKAFNQSSTLIKHKKIHT 

REKPYKCEECGKAFHLSTHLTTHKILHTGEKPYR 

CRECGKAFNHSATLSSHKKIHSGEKPYECDKCG 

BCAFISPSSLSRHEIIHTGEKP 


3578 


A 


1725 


445 


RPRRRGTHHFSCVLGSFRVSAMFPRVSTFLPLRP 

LSRHPLSSGSPETSAAAIMLLTVRHGTVRYRSSA 

LLARTKNNIQRYFGTNSVICSKKDKQSVRTEETS 

KETSESQDSEKENTKKDLLGIIKGMKVELST\nNV 

RTTKPPKRRPLKSLEATLGRLRRATEYAPKKRIEP 

LSPELVAAASAVADSLPFDKQTTKSELLSQLQQH 

P'P'CQP A HP n A K'P Pl^' TQF'QTvITTQ'n\yfl^ \/ADQATAP\/ 
DCIlotsAv^lNJ^AJSJvxJNJor oXNUoJJiVuv V AKoA.1 ArvV 

RSRPELRIQFDEGYDNYPGQEKTDDLKKRKMFT 

GKRLNIFDMMAVTKEAPETDTSPSLWDVEFAKQ 

LATVNEQPLQNGFEELIQWTKEGKLWEFPINNEA 

GFDDDGSEFHEHIFLEKHLESFPKQGPIRHFMELV 

TCGLSKNPYLSVKQKVEHmWFRNYFNEBCKDELK 

ESNIQFKLRPWKFLFRNN 


3579 


A 


1725 


445 


RPRRRGTHHFSCVLGSFRVSAMFPRVSTFLPLRP 

LSRHPLSSGSPETSAAAIMLLTVRHGTVRYRSSA 

LLARTKNNIQRYFGTNSVICSKKDKQSVRTEETS 

KETSESQDSEKENTKKl)LLGnKGMKVELSTVNV 

RTTKPPKRRPLKSLEATLGRLRRATEYAPKKRIEP 

LSPELVAAASAVADSLPFDKQTTKSELLSQLQQH 

RSRPELRIQFDEGYDNYPGQEKTDDLKKRKNIFT 
GKRLNIFDMMAVTKEAPETDTSPSLWDVEFAKQ 
LATVNEOPLONGFEELIOWTKEGKLWEFPINNEA 

gfdddgsefhehiflekhlesfpkqgpirhfmelv 
tcglsknpylsvklqkvehiewfrnyft^ekkdilk: 
esniqfklrpwkflfrnn 


3580 


A 


3673 


1619 


lycvapysrhllgrmshlpmkllrkkieicrnlk 
lrqrnlkfqgasnltlsetqngdvseetmgsrx 
vkkskqkpmnvglsetqnggmsqeavgmikvt 
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SEQlD 
NO: 


Method 


Predicted 

begionJog 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A°Aianine C'=Cysteine, D=>Aspartic Acid, 
£>=Glutaniic Acid, F-Phenylalanine, G^iycine, H»Histidine, 
I»Isoleucine, K^Lysinc, I^LeucinC) M=Methionine, 
N^Asparagine, P^ProIine, Q=<llutaniiDe, R^Arginine, S=Serinc, 
T^Threonine, V=VaIine, W=Tryptophan, Y=Tyrosine, 
X=:XJnknown, *^top codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion 




■ 






KSPQKSTVLTNGEAAMQSSNSESKKKKKKXR^ 

MVNDAEPDTKKAKTENKGKSEEESAETTKETE>I 

NVEKPDNDEDESEVPSLPLGLTGAFEDTSFASLC 

NLVNENTLKAIKEMGmmTEIQHKSIRPLLEGR 

DLLAAAKTGSGKTLAFLIPAVELIVKLRFMPRNG 

TG\a.ILSPTKELAMQTFG\^KELMTHHVHTYGLI 

MGGSNRSAEAQKLGNGINIIVATPGRLLDHMQN 

TPGFMYKNLQCLVIDEADRILDVGFEEELKQIIKL 

LPTRRQTMLFSATQTRKVEDLARISLKKEPLYVG 

VDDDKANATVDGLEQGYVVCPSEKRPLLLFTFL 

KKbTRKKKLMVFFSSCMSVKYHYELLNYIDLPVL 

AIHGKOKONKRTTTFFOFCN ADSGTLLCTDV A A 

RGLDIPEVDWIVQYDPPDDPKEYIHRVGRTARGL 

NGRGHALULRPEELGFLRYLKQSKVPLSEFDFS 

WSKISDIQSQLEKLIEKNYFLHKSAQEAYKSYIRA 

YDSHSLKQIFNVNNLNLPQVALSFGFKVPPFVDL 

NVNSNEGKOKKRGGGGGFGYOKTKKVEKSKIF 

KHISKKSSDSRQFSH 


3581 


A 


23 


453 


LCRCICIKNITPHCLWDKVLSQFTYILDM-SNFMS 

HHPHSLRNSCLIRMDLLVWQFTIYTITFCFSHLSG 

RLTLSAQfflSHRPCLLSYSLLFWKVHHLFLEGFPC 

SPRLDEMSFHQFPQHPVHVSWHLPIVYKGSMT 

QVSPH 


3582 


A 


3 


950 


TRGCGNKMAGKKNVLSSLAVYAEDSEPESDGEA 
GIEAVGSAAEEKGGLVSDAYGEDDFSRLGGDED 
GYEEEEDENSRQSEDDDSETEKPEADDPKDNTE 
AEKRDPOELV A SFSER VRNMSPDF ]K IPPFPPGR r 

SNHLQDKIQKLYERKIKEGMDMNYnQRKKEFKN 

PSIYEKLIQFCAIDELGTNYPBCDMFDPHGWSEDS 

YYEALAKAQKIEMDKLEKAKKERTKIEFVTGTK 

KGTTTNATSTTTTTASTAVADAQKRKSKWDSAI 

PVTTIAQPTILlll ATLPAVVTVTTSASGSKTTVIS 

AVGTIVKKAKQ 


3583 


A 


3 


950 


TRGCGNKMAGKKNVLSSLAVYAEDSEPESDGEA 

GIEAVGSAAEEKGGLVSDAYGEDDFSRLGGDED 

GYEEEEDENSRQSEDDDSETEKPEADDPKDNTE 

AEKRDPOELVASFSERVRNMSPDEIKIPPEPPGRC 

SNHLQDKJQKLYERKJKEGMDMNYIIQRKKEFKN 

PSIYEKLIQFCAIDELGTNYPKDMFDPHGWSEDS 

YYEALAKAQKmMDKLEKAKKERTKlEFVTGTK 

KIGTTTNATSTTTTTASTAVADAQKRKSKWDSAI 

PVTmQPmri lATLPAVVTVITSASGSKTTVIS 

AVGTIVKKAKQ 


3584 


A 


3 


1139 


PGSTISSRADRLGAPVLAHPKMAERQEEQRGSPP 

LRAEGKADAEVKLILYHWTHSFSSQKVRLVIAE 

KALKCEEHDVSLPLSEHNEPWFMRLNSTGEVPV 

LIHGENIICEATQIIDYLEQTFLDERTPRLMPDKES 

MYYPRVQHYRELLDSLPMDAYTHGCDLHPELTV 

DSMBPAYATTRIRSQIGNTESELKKLAEENPDLQE 

AYIAKQKRLKSKLLDHDNVKYLKKILDELEKVL 

DQVETELPRRNEETPEEGQQPWLCGESFTLADVS 

LAVTLHRLKFLGFARRKWGNGKRPNLETYYERV 

LKRKTFNKVLGHVNNILISAVLPTAFRVAKKRAP 

KVLGTTLVVGLLAGVGYFAFMLFRKRLGSMILA 

LRPRPNYF 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, D^Aspartic Acid, 
E'^Glutamic Acid, F^Pfaenylalanine, G<=Glycine, H^Histidine, 
I^lsoleucine, K°Lysine, I>Leudtte, M^Metliionine, 
N»Asparagine, P^ProlinCy Q^GIutamine, R»Arginine, &=Serine, 
T-Tbreonine, V^Valinc, W^Tryptopban, Y=Tyrosinc, 
XaUnknovt^n, *»Stop codon, /-possible nncleotide deletion, 
V^possible nucleotide insertion 


3585 


A 


1 


1777 

4 


RRHSPGSPAFAPSSRATAICPRAARAPATLLLALG 

AVLWPAAGAWELTILHTNDVHSRLEQTSEDSSK 

CVNASRCMGGVARLFTKVQQIRRAEPNVLLLDA 

GIX3YQGTIWFTVYKGAEVAHFMNALRYDAMA 

LGNHEFDNGA/^GLIEPLLKEAKFPILSANIKAKGP 

LASQISGLYLPYKVLPVGDEVVGIVGYTSKETPF 

LSNPGTNLVFEDEITALQPEVDKLKTLNVNKIIAL 

GHSGFEMDKLIAQKVRGVDVVVGGHSNTFLYT 

GNPPSKEVPAGKYPFIVTSDDGRKVPWQAYAF 

GKYLGYLKIEFDERGNVISSHGNPILLNSSPEDPS 

IKADINKWRIKLDNYSTQELGKTIVYLDGSSQSC 

RFREO^IMGNLICDAMINN>n^RHTDEMFWNHVS 

MCILNGGGIRSPIDERNNGTITWENLAAVLPFGG 

TFDLVQLKGSTLKKAFEHSVHRYGQSTGEFLQV 

GGIH\rvnmLSRKPGDRVVKLDVLCTKCRVPSYD 

PLKMDEVYKVILPNFLANGGDGFQMIKDELLRH 

DSGDQDINWSTYISKMKVryPAVEGRIKFSTGS 

HCHGSFSLIFLSLWAVIFVLYO 


3586 


A 


1399 


881 


LSNKDVLSPQLKDENSKLRRKLNEVQSFSEAQTE 

MVRTLERKLEAKMIKEESDYHDLESWQQVEQN 

LELMTKRAVKAENHWKLKQEISLLQAQVSNFQ 

RENEALRCGQGASLTVVKQNADVALQNLRWM 

NSAQASIEQLVSGAETLNLVAEILKSIDRISEVKD 

EEEDS 


3587 


A 


88 


1639 


GCVGRGLPLPPRHPTPPSSSSSPFVLLAFLLLVRL 

DPAVSGBCMAAPRPPPARLSGVMVPAPIQDLEAL 

RALTALFKEQRNRETAPRTJFQRVLDILKKSSHA 

VELACRDPSQVENLASSLQLITECFRCLRNACIEC 

SVNQNSIRNLDTIGVAVDLILLFRELRVEQESLLT 

AFRCGLQFLGNIASRNEDSQSIVWVHAFPELFLS 

CLimPDKKIVAYSSMILFTSLNHERMKELEENLN 

lAIDVIDAYQKHPESEWPFLIITDLFLKSPELVQA 

MFPKLNNQERVTLLDLMIAKITSDEPLTKDDIPVF 

LRHAELIASTFVDOCKTVLKLASEEPPDDEEALA 

TIRLLDVLCEMTVNTELLGYLQVFPGLLERVIDL 

LRVIHVAGKETTNIFSNCGCVRAEGDISNVANGF 

KSHLIRLIGNLCYKNKDNQDKVNELDGIPLILDN 

CNISDSNPFLTQWVIYAIRNLTEDNSQNQDLIAK 

MEEQGLADASLLKKVGFEVEKKGEKLILKSTRD 

TPKP 


3588 


A 


3 


1462 


DSPRNRFEILGRPTRTPTRPGPRPAMEDLDALLSD 

LETTTSHMPRSGAPKERPAEPLTPPPSYGHQPQT 

GSGESSGASGDKDHLYSTVCKPRSPKPAAPAAPP 

FSSSSGVLGTGLCELDRLLQELNATQFNITDEIMS 

QFPSSKVASGEQKEDQSEDKKRPSLPSSPSPGLPK 

ASATSATLELDRLMASLSDFRVQNHLPASGPTQP 

PWSSTNEGSPSPPEPTGKGSLDTMLGLLQSDLSR 

RGVPTQAKGLCGSCNKPIAGQVVTALGRAWHPE 

HFVCGGCSTALGGSSFFEKDGAPFCPECYFERFSP 

RCGFCNQPIRHKMVTALGTHWHPEHFCCVSCGE 

PFGDEGFHEREGRPYCRRDFLQLFAPRCQGCQGP 

ILDNYISALSALWHPDCFVCRECFAPFSGGSFFEH 

EGRPLCENHFHARE.GSLCATCGLPVTGRCVSAL 

GRRFHPDHFTCTFCLRPLTKGSFQERAGKPYCQP 

CFLKLFG 1 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nacleotide 

locadon 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A==Alanine C=Cysteine, D^Aspartic Acid» 
E°Glutaroic Acid, f^Plienylalanlne, 0=GIycinef H»Hlstidine, 
I'^Isoleucine, K^Lysinc, Lr=Leudne, M^Methionine, 
N-Asparaginc, P^Prdine, Q=Glutamine, R°Arginine, S^Serine, 
T=Tbreonine, V^Valine, W=Tryptophan, Y=Tyrosine^ 
X°Unknown, ^'^top codon, A^possible nucleotide deletion, 
^possible nucleotide insertion 

* 


3589 


A 


226 


6793 

* 


SPPKKSRKCNLSFRLISAERWRFFLLILMEMPRKP 

RLTLFVQRRIENIATEREFDPEEFYYLLEAAEGHA 

KEGQGIKTDPRYnSQLGLNKDPLEEMAHLGNY 

DSGTAETPETDESVSSSNASLKLRRKPRESDFETI 

KLISNGAYGAVYFVRHKESRQRFAMKKINKQNL 

ILRNQIQQAFVERDILTFAENPFVVSMYCSFETRR 

HLCMVMEYVEGGDCATLMKNMGPLPVDMARM 

YFAETVLALEYLHWGIVHRDLKPDNLLVTSMG 

HIKLTDFGLSKVGLMSMTTNLYEGHIEKDAREFL 

DKQVCGTPEYIAPEVILRQGYGKPVDWWAMGII 

LYEFLVGCVPFFGDTPEELFGQVISDEINWPEKDE 

APPPDAQDLITLLLRQNPLERLGTGGAYEVKQHR 

FFRSLDWNSLLRQKAEFIPQLESEDDTSYFDTRSE 

KYHHMETEEEDDTNDEDFNVEIRQFSSCSHRFSK 

^a^SSrDRITQNSAEEKEDSVDKTKSTTLPSTETLS 

WSSEYSEMQQLSTSNSSDTESNRHKLSSGLLPKL 

AISTEGEQDEAASCPGDPHEEPGKPALPPEECAQ 

EEPEV'l"! PASTISSSTLSVGSFSEHLDQINGRSECV 

DSTDNSSKPSSEPASHMARQRLESTEKKKISGKV 

TKSLSASALSLMIPGDMFAVSPLGSPMSPHSLSSD 

PSSSRDSSPSRDSSAASASPHQPIVIHSSGKNYGFT 

IRAIRVYVGDSDIYTVHHWWNVEEGSPACQAGL 

KAGDLITHINGEPVHGLVHTEVIELLLKSGNKVSI 

TTTPFEm^IKTGPARRNSYKSRMVRRSKKSKKK 

ESLERRRSLFKKLAKQPSPLLHTSRSFSCLNRSLS 

SGESLPGSPTHSLSPRSPTPSYRSTPDFPSGTNSSQ 

SSSPSSSAPNSPAGSGHIRPSTLHGLAPKLGGQRY 

RSGRRKSAGNIPLSPLARTPSPTPQPTSPQRSPSPL 

LGHSLGNSKIAQAFPSKMHSPPnVRHIVRPKSAE 

PPRSPLLKRVQSEEKLSPSYGSDKKHLCSRKHSL 

EVTQEEVQREQSQREAPLQSLDENYCDVPPLSRA 

RPVEQGCLKRPVSRKVGRQESVDDLDRDKLKAK 

VVVKKADGFPEKQESHQKFHGPGSDLENFALFK 

LEEREKKVYPKAVERSSTFENKASMQEAPPLGSL 

LKDALHKQASVRASEGAMSDGPVPAEHRQGGG 

DFRRAPAPGTLQDGLCHSLDRGISGKGEGTEKSS 

QAKELLRCEKLDSKLANIDYLRKKMSLEDKEDN 

LCPVLKPKMTAGSHECLPGNPVRPTGGQQEPPPA 

SESRAFVSSTHAAQMSAVSFVPLKALTGRVDSGT 

EKPGLVAPESPVRKSPSEYKLEGRSVSCLEPIEGT 

LDIALLSGPQASKTELPSPESAQSPSPSGDVRASV 

PPVLPSSSGKKNDTTSARELSPSSLKMNKSYLLEP 

WFLPPSRGLQNSPAVSLPDPEFKRDRKGPHPTAR 

SPGTVMESNPQQREGSSPKHQDHTTDPKLLTCLG 

QNLHSPDLARPRCPLPPEASPSREKPGLRESSERG 

PPTARSERSAARADTCREPSMELCFPETAKTSDN 

SKNLLSVGRTHPDFYTQTQAMEKAWAPGGKTN 

HKDGPGEARPPPRDNSSLHSAGIPCEKELGKVRR 

GVEPKPEALLARRSLQPPGIESEKSEKLSSFPSLQ 

KDGAKEPERKEQPLQRHPSSIPPPPLTAKDLSSPA 

ARQHCSSPSHASGREPGAKPSTAEPSSSPQDPPKP 

VAAHSESSSHKPRPGPDPGPPKTKHPDRSLSSQK 

PSVGATKGKEPATQSLGGSSREGKGHSKSGPDVF 

PATPGSQNKASDGIGQGEGGPSVPLHTDRAPLDA 

KPQPTSGGRPLEVLEKPVHLPRPGHPGPSEPADQ 
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SEQID 
NO: 


Method 


Predicted 

beginning 

auclcotide 

location 

corresponding 

to flrst amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alantne C^Cysteine, D^Aspartic Add, 
E^Glutamic Acid, F^Phenylalanine, &=Glycine, H^HIstidine, 
I=Isolcucine, K^Lysine, Lr=Leacine, M^Methlonine, 
N-Asparagine, P=Proline, Q=Glutaniine, R^Arginine, SsSerine, 
T«Threonine, V=Valinc, W«Tryptopiian, Y^Tyrosioe, 
X«Unl(nown, *^top codon, /^possible nucleotide deletion, 
\»possible nucleotide insertion 










KLSAVGEKQTLSPKHPKPSTVKDCPTLCKQTDN 
RQTDKSPSQPAANTDRRAEGKKCTEALYAPAEG 
DKLEAGLSFVHSENRLKGAERPAAGVGKGFPEA 
RGKGPGPOKPPTFADKPNGMICRSPSATGnS^FRS 

TALPEKSLSCSSSFPETRAGVREASAASSDTSSAK 

AAGGMLELPAPSNRDHRKAQPAGEGRTHMTKS 

DSLPSFRVSTLPLESHHPDPNTMGGASHRDRALS 

VTATVGETKGKDPAPAQPPPARKQNVGRDVTKP 

SPAPNTDRPISLSNEKDFVVRQRRGKESLRSSPHK 

KAL 


3590 


A 


3 


935 


RATTRPKNEVQDYVSVEYLSPHMGGTDPFKYSY 
PPLVDDDFQTPLCENGPITSEDETSSKEDIESDGK 
ETLETISNEEQTPLLKKINPTESTSKAEENEKVDS 

KTLrVLTNVTKNIVAFKVRTTAPEKYRVKPSNSS 
CDPGASVDIVVSPHGGLTVSAQDRFUMAAEME 

SSKPNTLTLKDNAFNMSDKTSEDIGLQLSRLLES 

NRKLEDQVQRCIWFQQLLLSLTMLLLAFVTSFFY 

LLYS 


3591 


A 


303 


2 


GGSWGPLCPVSPAMSLSDPGLGYHPTCWTLRWP 

PLCSLHALHVFHCLFSSRLGTPVSPRLAMDPNCS 

CEAGGSCACAGSCKCKKCKCTSCKKSCCSCCPL 


3592 


A 


1052 


1779 


GKTMMRKMLLAAALSVTAMTAHADYQCSVTP 

rvUL» VI V or^V^ 1 Vl^V JS,vjJCilNOJNirfVl UrUOiN VJVl I rSOiv 

QYSLNAAQREQAKDYQAELRSTLPWIDEGAKSR 

VEKARIALDKJIVQEMGESSKMRSRLTKLDAQVK 

EQMNRIIETRSDGLTFHYKAIDQVRAEGQQLVNQ 

AMGGILQDSINEMGAKAVLKSGGNPLQNVLGSL 

GGLQSSIQTEWKKQEKDFQQFGKDVCSRWTLE 

DSRKALVGNLK 


3593 


A 

* 


3 


1837 


LSFEKVDIQTDNDLTKEMYEGKENVSFELQRDFS 

QETDFSEASLLEKQQEVHSAGN1KKEKSNTE)GT 

VKDETSPVEECFFSQSSNSYQCHTITGEQPSGCTG 

LGKSISFDTKLVKHEIINSEERPFKCEELVEPFRCD 

SQLIQHQENNTEEKPYQCSECGKAFSINEKLIWH 

QRLHSGEKPFKCVECGKSFSYSSHYITHQTIHSGE 

KPYQCKMCGKAFSVNGSLSRHQRIHTGEBCPYQC 

KECGNGFSCSSAYITHQRVHTGEKPYECNDCGK 

AFNGNAKLIQHQRIHTGEKPYECNECGKGFRCSS 

QLRQHQSIHTGEKPYQCKECGKGFNNNTKLIQH 

QRJMTASLAEQLFKASGNHPNWGCCLTISSPGPS 

VYGPKMNMRGAPNSRLAGGREKRTQDTDFGQC 

SFi PSHSP^rFFPWMVTnYnsswYROKnvT snv 

WSSPLSILKLPRTLIRISIHIQEMDTPGEMLMTGR 

GSLGPTL'rrKAPAAAQPGKQGPPGTGRCLQAPGT 

EPGEOTPEGARELSPLOESSSPGGVKAEEEORAG 

AEPGTRPSLARSDDNDHEVGALGLQQGKSPGAG 
NPEPEQDCAARAPVRAEAVRRMPPGAEAGSVVL 
DD 


3594 


A 


39 


261 


RAAMMDTSRVQPDCLAIVIKVLGRTGSQGQCTQ 

VRVEFMDDTSRSIIRSVKGPVREGDVLTLLESERE 

ARRLR 


3595 


A 


973 


68 


GRVGTKHQMADDAGAAGGPGGPGGPGMGNRG 
GFRGGFGSGIRGRGRGRGRGRGRGRGARGGKAE 
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SEQlD 
NO: 


Method 


Predicted 

beginning 

nucleotide 

locatloD 

correspondiDg 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A'=Alanine C=Cysteine, D=»Aspartic Acid, 
EXSIutamic Acid, F-Phenylalanine, G=Glycine, H=Histidine, 
l^Isoleucine, K^Lysine, L=Leucine, M^Methionine, 
N==Asparagine, P^Proline, (^Glutamine, R'^Arginine, S=Serine, 
T-Tbreonine, V=Valine, W=Tryptophan, Y^Tyrosinc, 
X»Unl{no>vn, *=Stop codon, A>possible nncleotide deletion, 
V^ossible nucleotide insertion 










DKEWMPVTXLGRLVKDMKIKSLEEIYLFSLPIKE 
SEIIDFFLGASLKDEVLKIMPVOKOTRAGORTRF 

KAFVAIGDYNGHVGLGVKCSKEVATAIRGAIBLA 

KLSIVPVRRGYWGNKIGICPHTVPCKVTGRCGSV 

LVRLIPAPRGTGIVSAPVPKKLLMMAGIDDCYTS 

ARGCTATLGNFAKATFDAISKTYSYLTPDLWKE 

TVFTKSPYQEFTDHLVKTHTOVSVQRTQAPAVA 

TT 


3596 


A 


106 


2960 


DERRVGAADMFGRSRSWVGGGHGKTSRNIHSL 

DHLKYLYHVLTKNTTVTEQNRNLLVETIRSITEIL 

IWGDQNDSSVFDFFLEKNMFVFFLNILRQKSGRY 

VCVQLLQTLNILFENISHETSLYYLLSNNYVNSn 

VHKFDFSDEEIMAYYISFLKTLSLKLN>MTVHFF 

YNEHTbTOFALYTEAIKFFhmPESMVRIAVRTITL 

NVYKVSLDNQAMLHYIRDKTAVPYFSNLVWnC 

SHVIELDDCVQTDEEHRNRGKLSDLVAEHLDHL 

HYLNDILIINCEFLNDVLTDHLLNRLFLPLYVYSL 

ENQDKGGERPKISLPVSLYLLSQVFLIIHHAPLVN 

SLAEVBLNGDLSEMYAKTEQDIQRSSAKPSIRCFI 

KPTETLERSLEMNKHKGKRRVQKRPNYKNVGEE 

EDEEKGPTEDAQEDAEKAKGTEGGSKGIKTSGES 

EEIEMVIMERSKLSELAASTSVQEQNTTDEEKSA 

AATCSESTQWSRPFLDMVYHALDSPDDDYHALF 

VLCLLYAMSHNKGMDPEKLERIQLPVPNAAEKT 

TYNHPLAERLIRIMNNAAQPDGKIRLATLELSCL 

LLKQQVLMSAGCIMKDVHLACLEGAREESVHLV 

RHFYKGEDffLDMFEDEYRSMTMKPMNVEYLM 

MDASILLPPTGTPLTGIDFVKRLPCGDVEKTRRAI 

RVFFMLRSLSLQLRGEPETQLPLTREEDLDCTDDV 

T r>T TsTM^riT TAPTVTTK'nnnMVnP^T AVmvnAyfQ 
LiUl^lH IN ^ULtirxK^ I V 1 i rvJU»^wlVl V I^IvOIjA V Lfl X V^lVlo 

LVEPDVSRLGWGVVKFAGLLQDMQVTGVEDDS 
R ALNTTTHKP A S SPH<^ K PFPTT O A TFTF^nTTTR PIT A IC 

QRLAKGRIQARRMKMQRIAALLDLPIQPTTEVLG 
FGLGSSTSTQHLPFRFYDQGRRGSSDPTVQRSVF 
ASVDKVPGFAVAQCINEHSSPSLSSQSPPSASGSP 
SGSGSTSHCDSGGTSSSSTPSTAQSPAGIGHVTQ 


3597 


A 


427 


277 


GVRRIQHHWAQMHECNVHTYASLFCLFLLHTG 
KLCCLNSHRHFHCIKYSK 


3598 


A 


1 


503 


FRPRTKKATAMYLEHYLDSIENLPCELQRNFQL 

MRELDQRTEDKKAEIDILAAEYISTVKTLSPDQR 

VERLQKIQNAYSKCKEYSDDKVQLAMQTYEMV 

DKHIRRLDADLARFEADLKDKMEGSDFESSGGR 

GLKKGRGQKEKRGSRGRGRRTSEEDTPKKKKH 

KGG 


3599 


A 


2 


3907 


KTITALAFSPDGKYLVTGESGHMPAVRVWDVAE 

HSQVAELQEHKYGVACVAFSPSAKYIVSVGYQH 

DMIVNVWAWKKNIVVASNKVSSRVTAVSFSED 

CSVFVTAGNRHIKFWYLDDSKTSKVNATVPLLG 

RSGLLGELRNNLFTDVACGRGKKADSTFCITSSG 

LLCEFSDRRLLDKWVELRVYPEVKDSNQACLPP 

SSFITCSSDNTIRLWNTESSGVHGSTLHRNILSSDL 

IKUYVDGNTQALLDTELPGGDKADASLLDPRVGI 

RSVCVSPNGQHLASGDRMGTLRVHELQSLSEML 

KVEAHDSEILCLEYSKPDTGLKLLASASRDRLIH 

VLDAGREYSLQQTLDEHSSSITAVKFAASDGQVR 
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SEQID 
NO: 


IVfethod 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A»Alanine C=Cysteine, D-Aspartic Add* 
E^Glutamic Acid, F^Phenylalanine, G=G}ycinei H»Histidine, 
I^'Isoleucine, K»Lysine, L^Leucine, M'^Metbionine, 
N^Asparagine, paProline, Q^GIutamlne, R'^Arginine, S>>$erine, 
T=Threonine, V=Valine, W=Tryptophan, Y^Tyrosioe, 
X=Unknown, *^top codon, /^possible nucleotide deletion, 
\Fpossible nucleotide insertion 




• 






MISCGADKSIYFRTAQKSGDGVQFTRTHHVVRK 

TTLYDMDVEPSWKYTAIGCQDRNIRIFNISSGKQ 

KKLFKGSQGEDGTLIKVQTDPSGIYIATSCSDKML 

SIFDFSSGECVATMFGHSEIVTGMKFSNDCKHLIS 

VSGDSCIFVWRLSSEMTISMRQKLAELRQRQRGG 

KQQGPSSPQRASGPNRHQAPSMLSPGPALSSDSD 

KEGEDEGTEEELPALPVLAKSTKKALASVPSPAL 

PRSLSHWEMSRAQESVGFLDPAPAANPGPRRRG 

RWVQPGVELSVRSMLDLRQLETLAPSLQDPSQD 

SLAIEPSGPRKHGQEALETSLTSQNEKPPRPQASQ 

PCSYPHURLLSQEEGVFAQDLEPAPIEDGIVYPEF 

SDNPTMDTSEFQVQAPARGTLGRVYPGSRSSEK 

HSPDSACSVDYSSSCLSSPEHPTEDSESTEPLSVD 

GISSDLEEPAEGDEEEEEEEGGMGPYGLQEGSPQ 

TPDQEQFLKQHFETLASGAAPGAPVQVPERSESR 

SISSRFLLQVQTRPLREPSPSSSSLALMSRPAQVPQ 

ASGEQPRGNGANPPGAPPEVEPSSGNPSPQQAAS 

VLLPRCRLNPDSSWAPKRVATASPFSGLQKAQS 

VHSLVPQERHEASLQAPSPGALLSREIEAQDGLG 

SLPPADGRPSRPHSYQNPTTSSMAKISRSISVGEN 

LGLVAEPQAHAPIRVSPLSKLALPSRAHLVLDIPK 
PLPDRPTLAAFSPVTKGRAPGFAEKPnFPVfiT OIC 

AHSTTERWACLGEGTTPKPRTECQAHPGPSSPCA 

QQLPVSSLFQGPENLQPPPPEKTPNPMECTKPGA 

ALSQDSEPAVSLEQCEQLVAELRGSVRQAVRLY 

HSVAGCKMPSAEQSRIAQLLRDTFSSVRQELEAV 

AGAVLSSPGSSPGAVGAEQTQALLEQYSELLLRA 

VERRMERKL 


3600 


A 


1688 


916 


IPGSTISCSMALCEAAGCGSALLWPRLLLFGDSIT 

OFSFOOGG WG ASLADRL VRKCDVLNRGF SG YN 

TRWAKHLPRLIRKGNSLDIPVAVTIFFGANDSAL 

KDENPKOHIPLEEYAANLKSMVOYLKSVDIPENR 

VILITPTPLCETAWEEQCIIQGCKLNRLNSVVGEY 

ANACLQVAQDCGTDVLDLWTLMQDSQDFSSYL 

SDGLHLSPKGNEFLFSHLWPLIEKKVSSLPLLLPY 

WRDVAEAKPELSLLGDGDH 


3601 


A 


44 


223 


VHFPLIPQLAKCFWTMNRAARNKSEKRYYSEFL 
QIAHLFNYGLSSFLREFIIFLIKLLQ 


3602 


A 


37 


1124 


VPKPASGKRRLEFRPQDSKACAATPHSPGRJTSR 

TRGSQKVRSVPPRLPWAQASASTDWEGLRGVPG 

PALRRENFLEAAASGRSGRTPTGGVGFRDVGGP 

HFPIFPAAHFLWCNLHTPRRPACNAPWHSPVGEI 

SPPPRESOLRRDPEVHFESPAHPLGFRLLPGRGLP 

ANAVTVETAAMAAPRQIPSHIVRLKPSCSTDSSF 

TRTPVPTVSLASRELPVSSWQVTEPSSKNLWEQI 

CKEYEAEOPPFPEGYKVKOEPVITVAPVEEMLFH 

GFSAEHYFPVSHFTMISRTPCPQDKSETINPKTCS 

PKEYLETFIFPVLLPGMASLLHQAKKEKCFEVVL 

QMTPSGGKACVWGHLPSSSHTI 


3603 


A 


286 


587 


NISNKAEVSSHPSVISHSMDSFGQPRPEDNQSVLR 

RMQKKYAVKTKQVFIKATGKKEDEHLVASDAEL 

DAKLEVFHSVQETCTELLKIIEKYQLRLNGMKS 


3604 


A 


103 


2440 


QPRRRVFPAAGRGPGRKCSQWGRQASVSFEDVT 

VDFSKEEWQHLDPAQRRLYWDVTLENYSHLLS 

VGYQIPKSEAAFKLEQGEGPWMLEGEAPHQSCS 
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SEQlD 
NO: 


Method 


Predicted 

beginning 

naclcotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A»Alaoine OCysteine, D=Aspartic Acid, 
E==Glutamic Add, F=FhenylalaDine, G=Glyclne,H==Histidine, . 
Hsoleuciae, K=Lysine, LpLeucine, IM^Mcthioniney 
N^Asparaglne, F^Prolinc, (Xvlntamine, RBArginine, S=Serine, 
T^Threoninc, V=VaIine, W^Tryptophan, Y«Tyrosine, 
X==Unluiown, *=^top codon^ /^possible nucleotide deletion, 
\-possible nucleotide insertion 










GEAIGKMQQQGIPGGFFHCERFDQPIGEDSLCSI 

LEELWQDNDQLEQRQENQNNLLSHVKVLIKERG 

YEHKNmKIfflVTTKLVPSIKM.HNCDTILKHTLN 

SHNHNRNSATKNLGKIFGNGNNFPHSPSSTKNEN 

AKTGANSCEHDHYEKHLSHKQAPTHHQKJHPEE 

KLYVCTECVMGFTQKSHLFEHQRIHAGEKSREC 

DKSNKVFPQKPQVDVHPSVYTGEKPYLCTQCGK 

VFTLKSNLITHQKIHTGQKPYKCSECGKAFFQRS 

DLFRHLRIHTGEKPYECSECGKGFSQNSDLSIHQ 

KTHTGEKHYECNECGKAKIKKSALRMHQRIHTG 

EKPYVCADCGKAFIQKSHFNTHQRIHTGEKPYEC 

SDCGKSFTKKSQLHVHQRIHTGEKPYICTECGKV 

FTHRTNLTTHQKTHTGEKPYMCAECGKAFTDQS 

NLIKHQKTHTGEKPYKCNGCGPCAFIWKSRLKIH 

OKSHIGERHYECKDCGKAFIOKSTLSVHORIHTG 

EKPYVCPECGKAFIQKSHFIAHHRIHTGEKPYECS 

DCGKCFTKKSQLRVHQKIHTGEKPNICAECGKAF 

TDRSNLITHQKIHTE^EKPYECGDCGKTFTWKSRL 

NIHOKSHTGERHYECSKCGKAFIOKATLSMHOII 

HTGKKPYACTECQKAFTDRSNLIKHQKMHSGEK 

RYKASD 


3605 


A 


3 


322 


SFRMSGRGKGGKGLGKGGAKRHRKVLRDNIQGI 
TKPAIRRLARRGGVKRISGLIYEETRGVLKVFLEN 
VIRDAVTYTEHAKRKTVTAMDVVYALKRQGRT 
LYGFGG 


3606 


A 

• 


1 


1749 


VPVTAEAKLMGFTQGCVTFEDVAIYFSQEEWGL 

LDEAQRLLYRDVMLENFALITALVCWHGMEDE 

ETPEQSVSVEGVPQVRTPEASPSTQKIQSCDMCV 

PFLTDILHLTDLPGQELYLTGACAVFHQDQKHHS 

AEKPLESDMDKASFVQCCLFHESGMPFTSSEVG 

KDFLAPLGILQPQAIANYEKPNKISKCEEAFHVGI 

SHYKWSQCRRESSHKHTFFHPRVCTGKRLYESS 

KCGKACCCECSLVQLQRVHPGERPYECSECGKS 

FSQTSHLNDHRRIHTGERPYVCGQCGKSFSQRAT 

LIKHHRVHTGERPYECGECGKSFSQSSNLIEHCRI 

HTGERPYECDECGKAFGSKSTLVRHQRTHTGEK 

PYECGECGKLFROSFSLVVHORIHTTARPYECGO 

CGKSFSLKCGLIQHQLIHSGARPFECDECGKSFSQ 

RTTLNKHHKVHTAERPYVCGECGKAFMFKSKL 

VRHQRTHTGERPFECSECGKFFRQSYTLVEHQKI 

HTGLRPYDCGQCGKSFIQKSSLIQHQWHTGERP 

YECGKCGKSFTQHSGLILHRKSHTVERPRDSSKC 

GKPYSPRSNIV 


3607 


A 


92 


331 


AMAGPGPGPGDPDEQYDFLFKLVLVGDASVGKT 
CWQRFKTGAFSERQGSTIGVDFTMKTLEIQGKR 
VKLQIWDTAGQER 


3608 


A 


545 


379 


AIKGYIHLSAPRNRYMHTTASKGRMLFMKVTM 

YMRRGVQIMGWSVRMAFMACFTQ 


3609 


A 


118 


873 


VWMAWQVSLLELEDRLQCPICLEVFKESLMLQC 

GHSYCKGCLVSLSYHLDTKVRCPMCWQWDGS 

SSLPNVSLAWVIEALRLPGDPEPKVCVHHRNPLS 

LFCEKDQELICGLCGLLGSHQHHPVTPVSTVCSR 

MKEELAALFSELKQEQKKVDELIAKLVKNRTRIV 

NESDVFSWVIRREFQELRHPVDEEKARCLEGIGG 

HTRGLVASLDMQLEQAQGTRERLAQAECVLEQF 
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SEQ ID 
NO: 


Method 


Predicted 

beginning 

nncieouQC 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
locanoD 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A«Alanine C=Cysteinc, D=>Aspartic Acid, 
E^lutamic Acid, F^Pbenylalanlne, G=Glycine, H^^Histidine, 
1— isoieuctne, K^^JLysiue, i^LiCucinef Jn=M.etiiioiiiDe, 
N^Asparagine, P^roline, Q=GlntaiDlne, RF^Arginine, S^erine, 
T=Threonine, V«Valinc,W«Tryptophan, Y=Tyrosine, 
X»Unknown, *^Stop codon, /-possible nucleotide deletion, 
V=possibIe nucleotide insertion 










GNEDHHEFIWKFHSMASR 


3610 


A 


2 

ff 


987 


DPRVRPPLLQPPPPLLPRLVILKMAPLDLDKYVEI 

ARLCKYLPENDLKRLCDYVCDLLLEESNVQPVS 

TPVTVCGDIHGQFYDLCELFRTGGQVPDTNYIFM 

GDFVDRGYYSLETFTYLLALKAKWPDRITLLRG 

NHESRQITQVYGFYDECQTKYGNANAWRYCTK 

VFDMLTVAALIDEQILCVHGGLSPDIKTLDQIRTI 

ERNQEIPHKGAFCDLVWSDPEDVDTWAISPRGA 

GWLFGAKVTNEFVHINNLKLICRAHQLVHEGYK 

FMFDEKLVTVWSAPNYCYRCGNIASIMVFKDVN 

TREPKLFRA VPDSERVIPPRni PYFL 


3611 


A 


2459 

• 


869 


AEKMTAELREAMALAPWGPVKVKKEEEEEENF 

PGQASSQQVHSENIKVWAPVQGLQTGLDGSEEE 

EKGQNISWDMAWLKATQEAPAASTLGSYSLPG 

TLAKSEILETHGTMNFLGAETKNLQLLVPK1EIC 

EEAEKPLnSERIQKADPQGPELGEACEKGNMLK 

RQRIKREKKDFRQVIVNDCHLPESFKEEENQKCK 

KSGGKYSLNSGAVKNPKTQLGQKPFTCSVCGKG 

FSQSANLWHQRIHTGEKPFECHECGKAFIQSAN 

LWHQRIHTGQKPYVCSKCGKAFTQSS^JLTVHQ 

KIHSLEKTFKCNECEKAFSYSSOLARHOKVHITE 

KCYECNECGKl'Fi'KSSNLIVHQRIHTGEKPFACN 

DCGKAFTQSANLIVHQRSHTGEKPYECKECGKA 

FSCFSHLIVHQRIHTAEKPYDCSECGKAFSQLSCL 

IVHQRIHSGDLPYVCNECGKAFTCSSYLLIHQRIH 

NGEKPYTCNECGKAFRQRSSLTVHQRTHTGEKP 

YECEKCGAAFISNSHLMRHHRTHLVE 


3612 


A 


318 


2245 

• 


SPMAEAALVNTPQIPMVTEEFVKPSQGHVTFEDI 

AVYFSQEEWGLLDEAQRCLYHDVMLENFSLMA 

SVGCLHGIEAEEAPSEQTLSAQGVSQARTPKLGP 

SIPNAHSCEMCDLVMKDILYLSEHQGTLPWQICPY 

TSVASGKWFSFGSNLQQHQNQDSGEKHIRKEESS 

ALLLNSCKIPLSDNLFPCKDVEKDFPTILGLLQHQ 

TTHSRQEYAHRSRETFQQRRYKCEQVFNEKVHV 

TEHQRVHTGEKAYKRREYGKSLNSKYLFVEHQR 

THNAEKPYVCNICGKSFLHKQTLVGHQQRIHTRE 

RSYVCIECGKSLSSKYSLVEHQRTHNGEKPYVCN 

VCGKSFRHKQTFVGHQQRIHTGERPYVCMECGK 

SFIHSYDRIRHQRVHTGEGAYQCSECGKSFIYKQ 

SLLDHHRIHTGERPYECKECGKAFIHKKRLLEHQ 

RIHTGEKPYVCIICGKSFIRSSDYMRHQRIHTGER 

AYECSDCGKAFISKQTLLKHHKIHTRERPYECSE 

CGKGFYLEVKLLQHQRIHTREQLCECNECGKVF 

SHQKRLLEHQKVHTGEKPCECSECGKCFRHRTS 

LIQHQKVHSGERPYNCTACEBCAFIYKNKLVEHQ 

RIHTGEKPYECGKCGKAFNKRYSLVRHQKVHIT 

EEP 


3613 


A 


817 


3345 


NQSHPDSETVTVEGGRRKMKSNQERSNECLPPK 

KREIPATSRSSEEKAPTLPSDNHRVEGTAWLPGN 

PGGRGHGGGRHGPAGTSVELGLQQGIGLHKALS 

TGLDYSPPSAPRSVPVATTLPAAYATPQPGTPVSP 

VQYAHLPHTFQnGSSQYSGTYASFIPSQLIPPTAN 

PVTSAVASAAGATITSQRSQLEAYSTLLANMGS 

LSQTPGHKAEQQQQQQQQQQQQQQQQQQQQQ 

QQQHQQQQQQQQQQQQQQHLSRAPGLITPGSPP 
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SEQUO 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A'^AIanine C=Cystefne» D^Aspartic Add, 
E^lutamic Acid, F^Phenylalanine, G=Glycine, H^Histidine, 
I-Isoleucine, K^Lysine, L»Leucine, M^Methionlne, 
N»Asparagine, P^ProHnc, Q^Glutamine, RsArgioine, S^eriac, 
T=Threonine, V^Valine, W=Tryptoplian, Y«Tyroslne, 
X=:Unknown, *»Stop codon, /^possible nucleotide deletion, 
V^^sible nucleotide insertion 










PAQQNQYVHISSSPQNTGRTASPPAIPVHLHPHQ 

TMIPHTLTLGPPSQVVMQYADSGSHFVPREATK 

KAESSRLQQAIQAKEVLNGEMEKSRRYGAPSSA 

DLGLGKAGGKSVPHPYESRHVWHPSPSDYSSR. 

DPSGVRASVMVLPNSNTPAADLEVQQATHREAS 

PSTLNDKSGLHLGKPGHRSYALSPHTVIQTTHSA 

SEPLPVGLPATAFYAGTQPPVIGYLSGQQQAITY 

AGSLPQHLVIPGTQPLLIPVGSTDMEASGAAPAIV 

TSSPQFAAVPHTFVTTALPKSENFNPEALVTQAA 

YPAMVQAQIHLPWQSVASPAAAPPTLPPYFMK 

GSnOLANGELKKVEDLKTEDFIOSAEISNDLKIDS 

STVERJEDSHSPGVAVIQFAVGEHRAQVSVEVLY 

EYPFFVFGOGWSSCCPERTSOLFDLPCSKLSVGD 

VCISLTLKNLKNGSVKKGQPVDPASVLLKHSKA 

DGLAGSRHRYAEQENGINQGSAQMLSENGELKF 

PEKMGLSAAPFLTKIEPSKPAATRKRRWSAPESR 

KLEKSEDEPPLTLPKPSLIPQEVKICIEGRSNVGK 


3614 


A 


3 


114 


FFESRLRCKCCEPRGSWARFGCWRLQPEFKPKQ 
LEG 


3615 


A 


3 


1603 


DAWALTNQFSDSKQHIEVLKESLTAKEQRAAELQ 

TEVDALRLRLEEKETMLNKKTKQIQDMAEEKGT 

QAGEIHDLKDMLDVKERKVNVLQICKIENLQEQL 

RDKEKQMSSLKERVKSLQADTTNTDTALTTLEE 

ALAEKERTIERLBCEQRDRDEREKQEEIDNYKKDL 

KDLKEKVSLLQGDLSEKJEASLLDLKEHASSLASS 

GLKKDSRLKTLEIALEQKKEECLKMESQLKICAH 

EAALEARASPEMSDRIQHLEREITRYKDESSKAQ 

AEVDRLLEILKEVENEKNDKDKKIAELESLTSRQ 

VKDONKKVANLKHKEOVEKKKSAOMLEEARRR 

EDNLNDSSQQLQDSLRKKDDRIEELEEALRESVQ 

ITAEREMVLAQEESARTNAEKQVEELLMAMEKV 

KQELESMKAKLSSTQQSLAEKETHLTNLRAERR 

KHLEEVLEMKQEALLAAISEKDANIALLELSSSK 

KKTQEEVAALKREKDRLVQQLKQQTQNRMKLM 

ADNYEDDHFKSSHSNQTNHKPSPDQDEEEGIWA 


3616 


A 


244 


1420 


RRRWRARGGLVPTLAWAEATGAYVPGRDKPDL 

PTWKRNFRSALNRKEGLRLAEDRSKDPHDPHKI 

YEFVNSGVGDFSQPDTSPDTNGGGSTSDTQEDIL 

DELLGNMVLAPLPDPGPPSLAVAPEPCPQPLRSPS 

LDNPTPFPNLGPSENPLKRLLVPGEEWEFEVTAF 

YRGROVFOOTISCPEGLRLVGSEVGDRTLPGWP 

VTLPDPGMSLTDRGVMSYVRHVLSCLGGGLAL 

WRAGQWLWAQRLGHCHTYWAVSEELLPNSGH 

GPDGEVPKDKEGGVFDLGPFIVGSLGPPDLITFTE 

GSGRSPRYALWFCVGESWPQDQPWTKRLVMVK 

WPTCLRALVEMARVGGASSLENTVDLHISNSHP 

LSLTSDQYKAYLQDLVEGMDFQGPGES 


3617 


A 


852 


304 


RGGLLSKMARVLKAAAANAVGLFSRLQAPIPTV^ 

RASSTSQPLDQVTGSVWNLGRLNHVAIAVPDLE 

KAAAFYKNILGAQVSEAVPLPEHGVSVVFVNLG 

NTKMELLHPLGRDSPIAGFLQKNKAGGMHHICIE 

VDNINAAVMDLKKKKIRSLSEEVBaGAHGKPVTF 

LHPKDCGGVLVELEQA 


3618 


A 


3 


5992 


DNIDETYGVNVQFESDEEEGDEDVYGEVREEAS 
DDDMEGDEAVVRCTLSANMYVDEILVWCASEL 
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SEQU) 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A-Alanine C°Cysteine, D='Aspartic Acid, 
&=G]utaniic Acid, F-Fhenylalanine, G-GIydne, H°Hiatidine, 
I=Isoleiicine, K»Lysiney LBLeudoe, M-Methloninet 
N^Asparagine, P=Protine, Q=Glutaralne, R=Argininei S=Serine, 
T=Threonine, V=Vallne, W=Tryptophan, Y=Tyroslne, 
X»Unknown, *=^top codon, ^possible nucleotide deletion, 
\»possible nucleotide insertion 




• 






NIPEFFPLESPHKKVGYGLSSRTWLQGGGKVIEA 

GRDLLVASGELMSSKKKDLHPRDIDAFWLQRQL 

SRFYDDAIVSQKKADEVLEILKTASDDRECENQL 

VLLLGFNTFDFIKVLRQHRMMILYCTLLASAQSE 

AEKERIMGKMEADPELSKFLYQLHETEKEDLIRE 

ERSRRERVRQSRMDTDLETMDLDQGGEALAPRQ 

VLDLEDLVFTQGSHFMANKRCQLPDGSFRRQRK 

GYEEVHVPALKPKPFGSEEQLLPVEKLPKYAQA 

GFEGFKTLNRIQSKLYRAALETDENLLLCAPTGA 

GKTNVALMCMLREIGKHINMDGTINVDDFKJIYl 

APMRSLVQEMVGSFGKRLATYGITVAELTGDHQ 

LCKEEISATQIIVCTPEKWDIITRKGGERTYTQLV 

RLIILDEIHLLHDDRGPVLEALVARAIRNIEMTQE 

DVRLIGLSATLPNYEDVATFLRVDPAKGLFYFDN 

SFRPWLEQTYVGITEKKAIKRFQIMNEIVYEKIM 

EHAGKNQVLVFVHSRKETGKTARAIRDMCLEKD 

TLGLFLREGSASTEVLRTEAEQCKNLELKDLLPY 

GFAIHHAGMTRVDRTLVEDLFGDKHIQVLVSTA 

Tlj\WGVNLPAinVIIKGTQVYSPEKGRWTELGA 

LDILQMLGRAGRPQYDTKGEGILITSHGELQYYL 

SLLNQQLPESQMV^SKLPDMLNAEIVLGNVQNA 

KDAVNWLGYAYLYIRMLRSPTLYGISHDDLKGD 

PLLDQRRLDLVHTAALMLDKNNLVKYDKKTGN 

FQVTELGRIASHYYITNDTVQTYNQLLKPTLSEIE 

LFRVFSLSSEFKNITVREEEKLELQKLLERVPIPVK 

ESffiEPSAKINVLLQAFISQLKLEGFALMADMVY 

VTQSAGRLMRAIFEIVLNRGWAQLTDKTLNLCK 

MmKRMWQSMCPLRQFRKLPEEVVKKIEKKNFP 

FERLYDLNHNEIGELIRMPKMGKTIHKYVHLFPK 

LELSVHLQPITRSTLKVELTITPDFQWDEKVHGSS 

EAFWILVEDVDSEVILHHEYFLLKAKYAQDEHLI 

TFFVPVFEPLPPQYFIRWSDRWLSCETQLPVSFR 

HLELPEKYPPPTELLDLQPLPVSALRNSAFESLYQ 

DKFPFFNPIQTQVFNTVYNSDDNVFVGAPTGSGK 

TICAEFAE.RMLLQNSEGRCVYITPMRLWQEQVY 

MDWYEKFQDRLNKKVVLLTGETSTDLKLLGKG 

NTOSTPEKWDILSRRWKQRKNVQNINLFVVDEV 

HLIGGENGPVLEVICSRMRYISSQIERPIRrVALSSS 

LSNAKDVAHWLGCSATSTFNFHPNVRPVPLELHI 

QGFNISHTQTRLLSMAKPVFHAITKHSPKKPVIVF 

VPSRKQTRLTAmiLTTCAADIQRQRFLHCTEKJDL 

IPYLEKLSDSTLKETLLNGVGYLHEGLSPMERRL 

VEQLFSSGAIQVVVASRSLCWGMNVAAHLVUM 

DTLYYNGKIHAYVDYPIYDVLQMVGHANRPLQ 

DDEGRCVIMCQGSKKDFFKKFLYEPLPVESHLD 

HCMHDHFNAErVTKTDENKQDAVDYLTWTFLYR 

RMTQNPNYYNLQGISHRHLSDHLSELVEQTLSDL 

EQSKCISIEDEMDVAPLhn^GMIAAYYYimrrTIEL 

FSMSLNAKTKVRGLIEIISNAAEYENIPIRHHEDN 

LLRQLAQKVPHKLIWPKFNDPHVKTNLLLQAHL 

SRMQLSAELQSDTEEILSKAIRLIQACVDVLSSNG 

WLSPALAAMELAQMVTQAMWSEDSYLRRLPPF 

PSGLFKRCTDKGVESVFDIMEMEDEERNALLQLT 

DSQIADVARFCNRYPNIELSYEVVDKDSIRSGGP 

VWLVQLEREEEVTGPVIAPLFPQKREEGWWW 
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SCQ ID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
eorrespondlng 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, D^Aspartic Acid, 
E^GlDtamlc Add, F«=Phenylalanine, G^iycine, H==Histidlne, 
Islsoleucine, KaLysine, L>»LeuciDe, M^Methioniae, 
N^Asparagine, P^ProUne, Q^Glatamine, R«Arginine, S=Serine, 
T=Threooine, V^'Vallne, W=Tryptophan, Y^Tyrosinc, 
X^Unknown, *'^top codon,/=posslble nucleotide deletion, 
\Bpossible nucleotide insertion 










IGDAKSNSLISIKRLTLQQKAKVKLDFVAPATGG 
RHNTLYFMSDAYMGCDQEYKFSVDVKEAETDS 
DSD 


3619 


A 


3 


5992 


DNIDETYGVNVQFESDEEEGDEDVYGEVREEAS 

DDDMEGDEAVVR(m.SANMYVDEILVWCASEL 

NIPEFFPLESPHKKVGYGLSSRTWLQGGGKVIEA 

GRDLLVASGELMSSKKKDLHPRDIDAFWLQRQL 

SRFYDDAIVSQKKADEVLEKLKTASDDRECENQL 

VLLLGFNTFDFIKVLRQHRMMELYCTLLASAQSE 

AEKERIMGKMEADPELSKFLYQLHETEKEDLIRE 

ERSRRERVRQSRMDTDLETMDLDQGGEALAPRQ 

VLJDLEDLVFTQGSHFMANKRCQLPDGSFRRQRK 

GYEEVHVPALKPKPFGSEEQLLPVEKLPKYAQA 

GFEGFKTLNRIQSKLYRAALETDENLLLCAPTGA 

GKT^rVALMCMLmGKHIN^fIXJTINVDDFKIIYI 

APMRSLVQEMVGSFGKRLATYGITVAELTGDHQ 

LCKEEISATQIIVCTPEKWDirrRKGGERTYTQLV 

RLIILDEIHLLHDDRGPVLEALVARAIRNIEMTQE 

DVFLIGLSATLPNYEDVATFLRVDPAKGLFYFDN 

SFRPWLEQTYVGITEKKAIKRFQIMNEIVYEKIM 

EHAGKNQVLVFVHSRKETGKTARAIRDMCLEKD 

TLGLFLREGSASTEVLRTEAEQCKNLELKDLLPY 

GFAIHHAGMTRVDRTLVEDLFGDKHIQVLVSTA 

TLAWGVNLPAHTVIIKGTQVYSPEKGRWTELGA 

LDELQMLGRAGRPQYDTKGEGILITSHGELQYYL 

SLLNQQLPIESQMVSKLPDMLNAEIVLGNVQNA 

KDAVNWLGYAYLYIRMLRSPTLYGISHDDLKGD 

PLLDQRRLDLVHTAALMLDK>JNLVKYDKKTGN 

FQVTELGRIASHYYITNTDTVQTYNQLLKPTLSEIE 

LFRVFSLSSEFKNITVREEEBCLELQKLLERVPIPVK 

ESIEEPSAKINVLLQAFISQLKLEGFALMADMVY 

VTQSAGRLMRAIFEIVLNRGWAQLTDKTLNLCK 

MTOKRMWQSMCPLRQFRKLPEEWKKIEKKNFP 

FERLYDLNHNEIGELIRMPKMGKTIHKYVHLFPK 

LELSVHLQPITRSTLKVELTITPDFQWDEKVHGSS 

EAFWILVEDVDSEVILHHEYFLLKAKYAQDEHLI 

TFFVPVFEPLPPQYFIRVVSDRWLSCETQLPVSFR 

HLE-PEKYPPPTELLDLQPLPVSALRNSAFESLYQ 

DKFPFFNPIQTQVTNTVYNSDDNVFVGAPTGSGK 

TICAEFAILRMLLQNSEGRCVYITPMRLWQEQVY 

MDWYEKFQDRLNKKVVLLTGETSTDLKLLGKG 

NniSTPEKWDILSRRWKQRKWQNINLFVVDEV 

HLIGGENGPVLEVICSRMRYISSQIERPIRIVALSSS 

LSNAKDVAHWLGCSATSTFNFHPNVRPVPLELHI 

QGFNISHTQTRLLSMAKPVFHAITKHSPKKPVIVF 

VPSRKQTRLTAIDILTTCAADIQRQRFLHCTEKDL 

IPYLEKLSDSTLKETLLNGVGYLHEGLSPMERE^L 

VEQLFSSGAIQVWASRSLCWGMNVAAHLVIIM 

DTLYYNGKIHAYVDYPIYDVLQMVGHANRPLQ 

DDEGRCVIMCQGSKiCDFFKKFLYEPLPVESHLD 

HCMHDHFNAEIVTKTIENKQDAVDYLTWTFLYR 

RMTQNPNYYNLQGISHRHLSDHLSELVEQTLSDL 

EQSKCISIEDEMDVAPLNLGMIAAYYYINYTTIEL 

FSMSLNAKTKVRGLIEnSNAAEYENIPIRHHEDN 

LLRQLAQKWHKLNNPKFNDPHVKTNLLLQAFIL 
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SEQW 
NO: 


Metbod 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A»Alanine C=Cysteine, D^Aspartic Acid, 
E>=Glutamic Acid, F^Phenylalanine, G=GIycine, H==Histidine, 
I»Isoleucine, KpLysine, ]>=Leucine» M=^AlethiooiDe, 
N=A5paraginc, PsProline, Q^Glutamine, R-Arginine, S«SeriDe, 
T^Threonine, V^Valine, W«Tryptophan, Y=Tyrosinc, 
X"Unknown, *»Stop codon, /^possible nucleotide deletion, , 
\=spossible nucleotide Insertion 










SRMQLSAELQSDTEEILSKAIRLIQACVDVLSSNG 

WLSPALAAMELAQMVTQAMWSEDSYLRRLPPF 

PSGLFKRCTDKGVESVFDIMEMEDEERNALLQLT 

DSQIADVARFCNRYPNEELSYEVVDKDSIRSGGP 

VVX^VQLEREEEVTGPVIAPLFPQKREEGWWVV 

IGDAKSNSUSIKRLTLQQKAKVKLDFVAPATGG 

RHNTLYFMSDAYMGCDQEYKFSVDVKEAETDS 

DSD 


3620 


A 


1205 


323 


VIKMALAARLLPQFLHSRSLPCGAVRLRTPAVAE 
VRLPSATLCYFCRCRLGLGAALFPRSARALAASA 
LPAOGSRWPVLSSPGLPAAFASFPACPORSYSTE 

EKPQQHQKTKMIVLGFSNPINWVRTRIKAFLIWA 

YFDKEFSITEFSEGAKQAFAHVSKLLSQCKFDLL 

EELVAKEVLHALKEKVTSLPDNHKNALAANIDEI 

VFTSTGDISIYYDEKGRKFV2SIILMCFWYLTSANIP 

SETLRGASVFQVKLGNQNVETKQLLSASYEFQR 

EFTQGVKPDWTIARIEHSKLLE 


3621 


A 


2 

• 


2995 


SSSRSRHSSISPVRLPLNSSLGAELSRKKKERAAA 

AAAAKMDGKESSYERSGSYSGRSPSPYGRRRSSS 

PFLSKRSLSRSPLPSRKSMKSRSRSPAYSRHSSSH 

SKKKRSSSRSRHSSISPVRLPLNSSLGAELSRKKK 

ERAAAAAAAKMDGKESSYERSGSYSGRSPSPYG 

RRRSSSPFLSKRSLSRSPLPSRKSMKSRSRSPAYS 

RHSSSHSKKKRSSSRSRHSSISPVRLPLNSSLGAEL 

SRKKKERAAAAAAAKMDGKESKGSPVFLPRKE 

NSSVEAKDSGLESKKLPRSVKLEKSAPDTELVNV 

THLNTEVKNSSDTGKVKLDENSEKHLVKDLKAQ 

GTRDSKPIALKEEIVTPKETETSEKETPPPLPTIASP 

PPPLPl-i'lPPPQTPPLPPLPPIPALPQQPPLPPSQPA 

FSQVPASSTSTLPPSTHSKTSAVSSQANSQPPVQV 

SVKTQVSVTAAIPHLKTSTLPPLPLPPLLPGDDDM 

DSPKETLPSKPVKKEKEQRTRHLLTDLPLPPELPG 

GDLSPPDSPEPKAITPPQQPYKKRPKICCPRYGER 

RQTESDWGKRCVDKFDIIGIIGEGTYGQVYKAKD 

KDTGELVALKKVRLDNEKEGFPITAIREIKILRQL 

IHRSVVNMKEIVTDKQDALDFKKDKGAFYLVFE 

YMDHDLMGLLESGLVHFSEDHIKSFMKQLMEGL 

EYCHKKNFLHRDIKCSNILLNNSGQDCLADFGLA 

RLYNSEESRPYTNKVITLWYRPPKLLLGEERYTP 

AIDVWSCGCILGELFTKKPIFQANLELAQLELISR 

LCGSPCPAVWPDVIKLPYFNTMKPKKQYRRRLR 

EEFSFIPSAALDLLDHMLTLDPSKRCTAEQTLQSD 

FLKDVELSKMAPPDLPHWQDCHELWSKKRRRQ 

ROSGVVVEEPPPSKTSRKETTSGTSTEPVKNSSPA 

PPQPAPGKVESGAGDAIGLADITQQLNQSELAVL 

LNLLQSQTDLSIPQMAQLLNIHSNPEMQQQLEAL 

NQSISALTEATSQQQDSETMAPEESLKEAPSAPVI 

LPSAEQTTLEASSTPADMONILAVLLSOLMKTOE 

PAGSLEENNSDKNSGPQGPRRTPTMPQEEAAGRS 

NGGNAL 


3622 


A 


16 


390 


TPERGSAYPETAAVRRPAGECPrrMSDLEAKLS'T' 
EHLGDKIKBEDIKLRVIGQDSSEIHFKVKMTrPLK 
KLKKSYCQRQGVPVNSLRFLFEGQRIADNHTPEE 
LGMEEEDVffiVYQEQIGGHSTV 


3623 


A 


2 


1544 


PPPAPGPDGLNEGCLHRLSMPHQRPRTCAMNPE 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=AIanine C=Cysteine, D-Aspartic Acid, 
£=Glutaniic Acid, F^Pbenylalanine, G^Glycine, H^'Histidine, 
I"Isoleucine, K^Lysine, I/=Leudne, M"Methionlne, 
N=>Asp8ragine, P^Prollne, Q^GIutamine, R«=Arginine, SoSerine, 
T=Threonine, V=Valinc, W=Tryptophan, Y^Tyrosine, 
X=Unknown» *==Stop codon, /=possible nucleotide deletion, 
V^possible nucleotide insertion 










LTMESLGTLHGARGGGSGGGGGGGGGGGGGGP 

GHEQELLASPSPHHARRGPRGSLRGPPPPPTAHQ 

ELGTAAAAAAAASRSAMVTSMASILDGGDYia>E 

LSIPLHHAMSMSCDSSPPGMGMSNTYTTLTPLQP 

LPPISTVSDKFHHPHPHHHPHHHHHHHHQRLSGN 

VSGSFTLMRDERGLPAMNmYSPYKEMPGMSQS 

LSPLAATPLGNGLGGLHNAQQSLPNYGPPGHDK 

MLSPNFDAHHTAMLTRGEQHLSRGLGTPPAAM 

MSHLNGLHHPGHTQSHGPVLAPSRERPPSSSSGS 

QVATSGQLEEINTKEVAQRITAELKRYSIPQAIFA 

QRVLCRSQGTLSDLLRNPKJPWSKLKSGRETFRR 

MWKWLQEPEFQRMSALRLAACKRKEQEPNKDR 

NNSQKKSRLVFTDLQRRTLFAIFKENKRPSKEMQ 

ITISQQLGLELTTVSNFFMNARRRSLEKWQDDLS 

TGGSSSTSSTCTKA 


3624 


A 

r 


27 


2152 


SARKAEAATSGTAARDGSVGRNLVPPPSASAPK 

AEVESNEKDNRPEEEEQVIHEDDERPSEKNEFSR 

RKRSKSEDMDNVQSKRRRYMEEEYEAEFQVKTT 

AKGDINQKLQKVIQWLLEEKLCALQCAVFDKTL 

AELKTRVEKIECmRHKTVLTELQAKIARLTKRF 

EAAKEDLKiCRHEHPPNPPVSPGKTVNDVNSNNN 

MSYRNAGTVRQMLESKRNVSESAPPSFQTPVNT 

VSSTNLVTPPAVVSSQPKLQTPVTSGSLTATSVLP 

APNTATWATTQVPSONPQPnSLQPLPVILHVPV 

AVSSQPQLLQSHPGTLVTNQPSGNVEFISVQSPPT 

VSGLTKNPVSLPSLPNPTKPNNVPSVPSPSIQRNP 

TASAAPLGTTLAVQAVPTAHSIVQATRTSLPTVG 

PSGLYSPSTNRGPIQMKIPISAFSTSSAAEQNSNTT 

PRIENQTNKTIDASVSKKAADSTSQCGKATGSDS 

SGVIDLTMDDEESGASODPKKLNHTPVSTMSSSO 

PVSRPLQPIQPAPPLQPSGVPTSGPSQTTIHLLPTA 

PTTVNVTHRPVTQVTTRLPVPRAPANHQVVYTT 

LPAPPAQAPLRGTVMQAPAVRQVNPQNSVTVRV 

PQTTTYVXnSINGLTLGSTGPQLTVHHRPPQVHTEP 

PRPVHPAPLPEAPQPQRLPPEAGSTSRPSEATLEV 

SHAFRVKMAIVLVMECPGGGSKLCHC 


3625 


A 


210 


1115 


ASPFLRPQGHDSGEREPFSQTPGLMQPFSIPVQIT 

LQGSRRRQGRTAFPASGKKRETDYSDGDPLDVH 

KJaPSSTGEDRAVMLGFAMMGFSVLMFFLLGTT 

ILKPFMLSIQREESTCTAIHTDIMDDWLDCAFTCG 

VHCHGQGKYPCLQVF.VNLSHPGQKALLHYNEE 

AVQINPKCFYTPKCHQDRNDLLNSALDDCEFFDH 

KNGTPFSCFYSPASQSEDVILIKKYDQMAIFHCLF 

WPSLTLLGGALIVGMVRLTQHLSLLCEKYSTVV 

RDEVGGKVPYIEQHQFKLCIMRRSKGRAEKS 


3626 


A 


9 


921 


SSVVEFSALSVSMACLSPSQLQKFQQDGFLVLEG 

FLSAEECVAMQQRIGEIVAEMDVPLHCRTEFSTQ 

EEEQLRAQGSTDYFLSSGDKIRFFFEKGVFDEKG 

NFLVPPEKSINKIGHALHAHDPVFKSITHSFKVQT 

LARSLGLQMPVVVQSMYIFKQPHFGGEVSPHQD 

ASFLYTEPLGRVLGVWIAVEDATLENGCLWFIPG 

SHTSGVSRRMVRAPVGSAPGTSFLGSEPARDNSL 

FVPTPVQRGALVLIHGEVVHKSKQNLSDRSRQA 

YTFHLMEASGTTWSPENWLQPTAELPFPQLYT 


3627 


A 


231 


644 


INSSPRTGRDHQELNLHTERDSRSQRAVLKIPRQ 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

actd residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A==Alanine C=Cystelne, ENAspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, G^Gtycine, H=Histidine, 
I^Isoleucine, K-Lysine, I>Leacine, M^Methionine, 
N«Asparagine,P°Proline, Q=Glutamine, R«Arginine» S^Serlne, 
T^Tbreonine, V=Valine, W=Tryptophan, Y=Tyrosine, 
X^^Unlcnown, ^'^Stop codon, /-pos^ble nucleotide deletion, 
V=possible nucleotide insertion 










NPGIFYWIFLPSRSHSASHGSRQRQVSCQGTQDEI 
LKMRl^TFAELKNSLEALSSRMDQAEERIGTQAG 
VQWRDHGSLQPQPPEFKQCFHLSLPSSWDYRAC 
LS 


3628 


A 


2 


810 


GCKHLLQNSWYDPRVREADRVGQRARRPRAAM 

DWLMGKSKAKPNGKKPAAEERKAYLEPEHTKA 

RITDFOFKELVVLPREEDLNEWLASNTTTFFHHIN 

LQYSTISEFCTGETCQTMAVCNTQYYWYDERGK 

KVKCTAPQYVDFVMSSYQKLVTDEDVFPTKYG 

REFPSSFESLVRKICRHLFHVLAHIYWAHFKETLA 

LELHGHLNTLYVHFILFAREFNLLDPKETAIMDD 

LTEVLCSGGRRGSTVGAVGMGPAAGAPGAQNH 

VKER 


3629 


A 


699 


1604 

• 


CSHGSSAVSAWSPLFQASEVERQLSMQVHALRE 
DFREra^SSlNQHIIRLESLQAEIKMLSDRKRELEH 
RLSATLEENDLLOGTVEELODRVLILEROGHDKD 

LQLHQSQLELQEVRLSCRQLQVKVEELTEERSLQ 

SSAATSrSLLSEIEQSMEAEELEQEREQLTLLSVE 

MTALKEEBU)RLRVTSEDKEPKEQLQKAIRDRDE 

AIAKKNAVELELAKCRMDMMSLNSQLLDAIQQ 

KLNLSOOLEAWODDMHRVIDROLMDTHLKERS 

QPAAALCRGHSAGRGDEPSIAEGICRLFSFFRKI 


3630 


A 


423 


1 


PAKVLTLDIYLSKTEGAQVDEPVVITPRAEDCGD 

WDDMEKRSSGRRSGRRRGSQKSTDSPGADAELP 

ESAARDDAVFDDEVAPNAASDNASAEKKVKSPR 

AALDGGVASAASPESKPSPGTKGQLRGESDRSK 

QPPPASSP 


3631 


A 


2082 


674 


WSGFWQLPGVRGVGSAPGGDGAEFTSRRGSSRR 

PGAACPGCRGAGSERAPGGMGRRRAPELYRAPF 

PLYALQVDPSTGLLIAAGGGGAAKTGIKNGVHF 

LQLELINGRLSASLLHSHDTETRATMNLALAGDI 

LAAGQDAHCQLLRFQAHQQQGNKAEKAGSKEQ 

GPRQRKGAAPAEKKCGAETQHEGLELRVENLQA 

VQTDFSSDPLQKVVCFNHDNTLLATGGTDGYVR 

VWKVPSLEKVLEFKAHEGEIEDLALGPDGKLVT 

VGRDLKASVWOKDOLVTOLHWOENGPTFSSTP 

YRYQACRFGQVPDQPAGLRLFTVQIPHKRLRQPP 

PCYLTAWDGSNFLPLRTKSCGHEWSCLDVSES 

GTFLGLGTVTGSVAIYIAFSLQCLYYVREAHGIV 

VTDVAFLPEKGRGPELLGSHETALFSVAVDSRCQ 

LHLLPSRRSVPVWLLLLLCVGLIIVTILLLQSAFPG 

FL 


3632 


A 


942 


40 


PWCQRVEVRSCGSSKRSCSRWSGSSWDGSRSLG 

RGLNHTSLNRSPPFTPDTNITHCCSPCCQPTCCRT 

TCCRTTCWKPTTVTTCSSTPCCOPSCCVPSCCOP 

CCHPTCCQNTCCRTTCCQPTCVASCCQPSCCSTP 

CCQPTCCGSSCCGQTSCGSSCCQPICGSSCCQPCC 

HPTCYQTICFRTTCCQPTCCQPTCCRNTSCQPTCC 

GSSCCQPCCHPTCCQTICRSTCCQPSCVTRCCSTP 

CCQPTCGGSSCCSQTCNESSYCLPCCRPTCCQTT 

CYRTTCCRPSCCCSPCCVSSCCQPSCC 


3633 


A 


605 


3004 


GPEGYRGRRARHPSLGSTTGHCGGGRGAEGTGT 
DPAAPAARLNVDGLLVYFPYDYIYPEQFSYMRE 
LKRTLDAKGHGVLEMPSGTGKTVSLLALIMAYQ 
RAYPLEVTKLIYCSRTVPEIEKVIEELRKLLNFYE 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

seqaence 


Predicted end 
nacieotide 
locatioo 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C=Cysteine» D=>Aspartic Acid, 
E^lutamic Acid, F=Phenyialanine, G^Iycine, H=Hlsndine» 
l^lsoleucine, K»Lysine^ JL»Leudne, M«Methioniae, 
N^Asparagine, P»Pro)ine, Q=Glatamine, R=Arginine, S«5erine, 
T=Threoninc, V=Valine, W=Tryptophan, Y^Tyrosine, 
X=Unl(nown, *B^top codon, /possible nucleotide deletion, 
V=:possible nucleotide insertion 










KQEGEKLPFLGLALSSRKNLCIHPEVTPLRFGKD 

VDGKCHSLTASYVRAQYQHDTSLPHCRFYEEFD 

AHGREVPLPAGIYNLDDLKALGRRQGWCPYFLA 

RYSILHANWVYSYHYLLDPKIADLVSKELARK 

AWVFDEAHNDDNVCIDSMSVNLTRRTLDRCQG 

NLBTLQKTVLRIKETDEQRLRDEYRRLVEGLREA 

SAARETDAHLANPVLPDEVLQEAVPGSIRTAEHF 

LGFLRRLLEYVKWRLRVQHWQESPPAFLSGLA 

QRVCIQRKPLRFCAERLRSLLHTLEITDLADFSPL 

TLLANFATLVSTYAKGFTniEPFDDRTPTIANPIL 

HFSCMDASLAIKPVFERPQSVIITSGTLSPLDIYPK 

ILDFHPVTMATFTMTLARVCLCPMIIGRGNDQVA 

ISSKFETREDIAVIRNYGKLLLEMSAVVPDGIVAF 

FTSYQYMESTVASWYEQGELENIQRNKLLFIETQ 

DGAETSVALEKYQEACENGRGAILLSVARGKVS 

EGIDFVHHYGRAVIMFGVPYVYTQSRILKARLEY 

LRDQFQIRENDFLTFDAMRHAAQCVGRAIRGKT 

DYGLMVFADKRFARGDKRGKLPRWIQEHLTDA 

NLNLTVDEGVQVAKYFLRQMAQPFHREDQLGL 

SLLSLEQLESEETLKRIEQIAQQL 


3634 


A 


159 


384 


LKMSSKTASTNNIAQARRTVQQLRLEASIERIKV 
SKASADLMSYCEEHARSDPLLIGIPTSENPFKDKK 

TCIIL ' 


3635 


A 


5 


409 


TELSQLEKAHPPADMGIIRKSKEKPPPKKKMTO^ 
LETQFTCPFCNHEKSCDVKMDRAROTGVISCTV 
CLEEFQTPITCILGNLGFFQRVGRGLESGPCSSGP 
LCALVQGQSRPEEQVPPSDFCGVRRCRAGFQCQ 


3636 


A 


48 


282 


DHLKSCYQDSHEDPTKMKRFLFLLLTISLLVMVQ 
IQTGLSGQNDTSQTSSPSASSSMSGGIFLFFVANAI 
IHLFCFS 


3637 


A 


1 


1248 


ARAGSVVGSAAARGPPAGCRCERAARLPSSPAR 

RRRCDWVEDGAGRMEILMTVSKFASICTMGAN 

ASALEKEIGPEQFPVNEHYFGLVNFGNTCYCNSV 

LQALYFCRPFREKGLAYKSQPRKKESLLTCLADL 

FHSIATQKKKVGVIPPKKFITRLRKENELFDNYM 

QQDAHEFLNYLLNTIADILQEERKQEKQNGRLPN 

GMDNENNNSTPDPTWVHEIFQGTLTNETRCLTC 

ETISSKDEDFLDLSVDVEQNTSITHCLRGFSNTET 

LCSEYKYYCEECRSKQEAHKRMKVKKLPMILAL 

HLKRFKYMDQLHRYTKLSYRVVFPLELRLFNTS 

GDATNPDRMYDLVAVVVHCGSGPNRGHYIAIV 

KSHDFWLLFDDDIVEKIDAQAIEEFYGLTSDISKN 

SESGYILFYQSRD 


3638 


A 


11 


630 


PAGIPVSTISSDRRASTDLTRKMKPDETPMFDPNL 

LKEVDWSQNTATFSPAISPTHPGEGLVLRPLCTA 

DLNRGFFKVLGQLTETGVVSPEQFMKSFEHMKK 

SGDYYVTWEDVTLGQIVATATLIIEHKFIHSCAK 

RGRVEDWVSDECRGKQLGNLLLSTLTLLSKKL 

NCYKITLECLPQNVGFYXKFGYTVSEENYMCRR 

FLK 


3639 


A 


2 


1200 


PRVRLLRPSRSRSCRGLLSTRAPGPSPFRSLHSSPL 

LPHAMKSPFYRCQNTTSVEKGNSAVMGGVLFST 

GLLGNLLALGLLARSGLGWCSRRPLRPLPSVFY 

MLVCGLTVTDLLGKCLLSPVVLAAYAQNRSLRV 

LAPALDNSLCQAFAFFMSFFGLSSTLQLLAMALE 
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SEQlD 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (Ai^Alanine C=Cy5teine, D^Aspartic Acid, 
£=Glatamic Acid, r=PhenytaIamne, G=Glycine, H>Histidine, 
I^Isoleucine, K=Ly$ine, L^Leudne, M>=Methionine, 
N^Asparagine, P^Prdine, Q^GIutamlnc, R»Argiuinc, S»Serine, 
T«Tlireonine, V=Valine, W«Tryptophan, Y^Tyroslne, 
X-Unknown, ^'^Stop codon, /-possible nucleotide deletion, 
X'^possible nucleotide insertion 










CWLSLGHPFTYERHITLRLGALVAPVVSAFSLAF 

CALPFMGFGKFVQYCPGTWCFIQMVHEEGSLSV 

LGYS\a.YSSLMALLVLAT\n.Chn:.GAMRM.YAM 

HRRLQRHPRSCTRDCAEPRADGREASPQPLEELD 

HLLLLALMTVLFTMCSLPVIYRAYYGAFKDVKE 

KNRTSEEAEDLRALRFLSVISTVDPWIFUFRSPVFR 

IFFHKIFIRPLRYRSRCSNSTNMESSL 


3640 


A 


930 


182 


PLPPPTLAMFLTRSEYDRGVNTFSPEGRLFQVEY 

AIEAIKLGSTAIGIQTSEGVCLAVEKRITSPLMEPS 

SIEKIVEIDAHIGCAMSGLIADAKTLIDKARVETQ 

MHWKIYNETMTVESVTQAVSNLALQFGEEDADP 

GAMSRPFGVALLFGGVDEKGPQLFHMDPSGTFV 

QCDARAIGSASEGAQSSLQEVYHKSMTLKEAIKS 

SLIILKQVMEEKLNATNIELATVQPGQNFHMFTK 

EELEEVIKDl 


3641 


A 


2 


1254 


PTGQGGRRAEARSCLLSKAMLGRSGYRALPLGD 

FDRFQQSSFGFLGSQKGCLSPERGGVGTGADVPQ 

SWPSCLCHGLISFLGFLLLLVTFPISGWFALKIVPT 

YERMIVFRLGRIRTPQGPGMVLLLPFIDSFQRVDL 

RTRAFNVPPCKLASKDGAVLSVGADVQFRIWDP 

VLSVMTVKDLKTATRMTAQNAMTKALLKRPLR 

EIOMEKLKISDOLLLEINDVTRAWGLEVDRVELA 

VEAVLQPPQDSPAGPNLDSTLQQLALHFLGGSM 

NSMAGGAPSPGPADTVEMVSEVEPPAPQVGARS 

SPKQPLAEGLLTALQPFLSEALVSQVGACYQFNV 

VLPSGTQSAYFLDLTTGRGRVGHGVPDGIPDVV 

VEMAEADLRALLCRELRPLGAYMSGRLKVKGD 

LAMAMKLEAVLRALK 


3642 


A 


1 


237 


RRGEDDMATEGDVELELETETSGPERPPEKPRKH 

DSGAADLERVTDYAEEKEIQSSNLETAMSVIGDR 

RSREQKAKQER 


3643 


A 


94 


541 


RKERRRRRRRMEAVVFVFSLLDCCALIFLSVYFII 

TLSDLECDYINARSCCSKLNKWVIPELIGHTIVTV 

LLLMSLHWFIFLLNLPVATWNIYRYIMVPSGNM 

GVFDPTEIHNRGQLKSHMKEAMIKLGFHLLCFF 

MYLYSMILALIND 


3644 


A 


95 


2808 


TSCRHFPITSEDPLNYLLILTVERIYAYQALPLGFL 

FCSRDPVPEYLNHCGVKYVLISDRASFCALHIFFS 

PFRNVFRPAAGGGIAPPPRLWFQPSLSDAEMEIPK 

LLPARGTLQGGGGGGIPAGGGRVHRGPDSPAGQ 

VPTRRLLLPRGPQDGGPGRRREEASTASRGPGPS 

LFAPRPHQPSGGGGGGGDDFFLVLLDPVGGDVE 

TAGSGQAAGPVLREEAEEGPGLQGGESGANPAG 

PTALGPRCLSAVPTPAPISAPGPAAAFAGTVTIHN 

QDLLLRFENGVLTLATPPPHAWEPGAAPAQQPG 

CLIAPQAGFPHAAHPGDCPELPPDLLLAEPAEPAP 

APAPEEEAEGPAAALGPRGPLGSGPGVVLYLCPE 

ALCGQTFAKXHQLKNIHLLTHSSSQGQRPFKCPL 

GGCGWTFTTSYKLKRHLQSHDKLRPFGCPAEGC 

GKSFITVYNLKAHMKGHEQENSFKCEVCEESFP 

TQAKLGAHQRSHFEPERPYQCAFSGCKKTFITVS 

ALFSHNRAHFREQELFSCSFPGCSKQYDKACRLK 

IHLRSHTGERPFLCDFDGCGWNFTSMSKLLRHKR 

KHDDDRRFMCPVEGCGKSFTRAEHLKGHSITHL 

STKPFVCPVAGCCARFSARSSLYIHSKKHLQDVD 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue of 
peptide 
sequence 


Amino flcid sequence (A=Alaaine C^ysteine, D=Aspartic Acid, 
E«=Glutan]ic Acid, F^Phenylalanine, G=CIycinc, H==HistidiDe, 
I~Isoleucine, K^Lysine, L»Leucine, M»Methionine, 
N=Asparagine, P^Proline, <^Glntamlne, RsArginine, S«Serlne, 
T='Threonlne, V=VaIlne, W»Tryptophan, Y>=Tyr^ne, 
X-Unknown, *»Stop codon, A=:possible nucleotide deletion, 
\=pos5ible Ducleotide insertion 










TWKSRCPISSCNKLFTSKHSMKTHMVKRHKVGQ 

DLLAOLEAANSLTPS SELTSORONDLSDAEIVSLF 

SDVPDSTSAALLDTALVNSGILTIDVASVSSTLAG 

HLPANNNNSVGQAVDPPSLMATSDPPQSLDTSLF 

FGTAATGFQQSSLNMDEVSSVSVGPLGSLDSLA 

MKNSSPEPQALTPSSKLTVDTDTLTPSSTLCENSV 

SELLTPAKAEWSVHPNSDFFGOEGETOFGFPNAA 

GNHGSQKERIvfLITVTGSSFLV 


3645 


A 


2194 


1707 


TVSFHKTMASLKCSTVVCVICLEKPKYRCPACRV 

PYCSVVCFRKflKEQCNPETRPVEKKIRSALPTKr 

VKPVENKDDDDSIADFLNSDEEEDRVSLQNLKN 

LGESATLRSLLLNPHLRQLMVNLDQGEDKABCLM 

RAYMQEPLFVEFADCCLGIVEPSQNEES 


3646 


A 

■ 


85 


1948 


ERGGGKAAAAAAAAAAARALAASGQDPRPHPR 

APPWDDSGDDDEATTPADKSELHHTLKNLSLKL 

DDLSTCNDLIAKHGAALQRSLTELDGLKIPSESG 

EKLKWNERATLFRITSNAMINACRDFLELAEIHS 

RKWQRALQYEQEQRVHLEETIEQLAKQHNSLER 

AFHSAPGRPANPSKSFIEGSLLTPKGEDSEEDEDT 

EYFDAMEDSTSFITVITEAKEDSRKAEGSTGTSSA 

DWSSADNVLIX5ASLVPKGSSKVKRRVRIPNKPN 

YSLNLWSlMKNCIGRELSRffMPVNFNEPLSMLQ 

RLTEDLEYHHLLDKAVHCTSSVEQMCLVAAFSV 

SSYSTTVHRIAKPFNPMLGETFELDRLDDMGLRS 

LCEQVSHHPPSAAHYVFSKHGWSLWQEITISSKF 

ROKYTSTMPT nATWT PFOA^^rtTsTHVVWRK'^sTQTVR 
fvvjiv I ioiiyxi^i^\j/\ixTi^Cri7v^/\ovJi>rT i v wivivo i d 1 V n. 

NIIVGKLWIDQSGDIEIVNHKTNDRCQLKFLPYSY 

FSKEAARKVTGVVSDSQGKAHYVLSGSWDEQM 

ECSKVMHSSPSSPSSDGKQKTVYQTLSAKLLWK 

KYPLPENAENMYYFSELALTLNEHEEGVAPTDS 

RLRPDQRLMEKGRWDEANTEKQRLEEKQRLSR 

RRRLEACGPGSSCSSEE 




A 


46 


5007 


PTGDACVSTSCELASALSHLDASHLTENLPKAAS 

ELGQQPMTELDSSSDLJSSPGKXGAAHPDPSKrS 

VDTGQVSRPENPSQPASPRVTKCKARSPVRLPHE 

GSPSPGEKAAAPPDYSKTRSASETSTPHNTRRVA 

ALRGAGPGAEGMTPAGAVLPGDPLTSQEQRQGA 

PGNHSKALEMTGIHAPESSQEPSLLEGADSVSSR 

APQASLSMLPSTDNTKEACGHVSGHCCPGGSRE 

SPVTDIDSFIKELDASAARSPSSQTGDSGSQEGSA 

QGHPPAGAGGGSSCRAEPVPGGQTSSPRRAWAA 

GAPAYPQWASQPSVLDSIhnPDKHFTVNKNFLSN 

YSRNFSSFHEDSTSLSGLGDSTEPSLSSMYGDAE 

DSSSDPESLTEAPRASARDGWSPPRSRVSLHKED 

PSESEEEQIEICSTRGCPNPPSSPAHLPTQAAICPAS 

AKVLSLKYSTPRESVASPREKVACLPGSYTSGPD 

SSQPSSLLEMSSQEHETHADISTSQNttRPSCAEET 

TEVTSASSAMENSPLSKVARHFHSPPIILSSPNMV 

NGLEHDLLDDETLNOYETSINAAASLSSFSVDVP 

KNGESVLENLHISESQDLDDLLQKPKMIARRPIM 

AWFKEINKHNQGTHLRSKTEKEQPLMPARSPDS 

KIQMVSSSQKKGVTVPHSPPQPKTNLENKDLSKK 

SPAEMLLTNGQKAKCGPKLKRLSLKGKAKVNSE 

APAANAVKAGGTDHRKPLISPQTSHKTLSKAVS 

QRLHVADHEDPDRNTTAAPRSPQCVLESKPPLAT 
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SEQlO 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C=Cysteine, D=:Aspartic Acid, 
E>=Glatamlc Add, f^Fhenylalanine, G^Glycine, H»Histidinc, 
I^lsoleucine, K^Lysine, L^Leudne, M=Methionine, 
N^Asparagine, P»Proline, Q^GIutamlne, R»Arginine, S^^erine, 
T-Threonine, V=Va«ne, W=Tryptophan, Y"Tyrosine, 
X"Unknown, *«Stop codon, /^possible nucleotide deletion, 
\ppos5ible nudeotide insertioa 










SGPLKPSVSDTSIRTFVSPLTSPKPVPEQGMWSEIF 

HMAVLSEPDRGCPTTPKSPKCRAEGRAPRADSG 

PVSPAASRNGMSVAGNRQSEPRLASHVAADTAQ 

PRPTGEKGGNIMASDRLERTNQLKIVEISAEAVSE 

TVCGNKPAESDRRGGCLAQGNCQEKSEIRLYRQ 

VAESSTSHPSSLPSHASQAEQEMSRSFSMAKLAS 

SSSSLQTAIRKAEYSQGKSSLMSDSRGVPRNSIPG 

GPSGEDHLYFTPRPATRTYSMPAQFSSHFGREGH 

PPHSLGRSRDSQVPVTSSVVPEAKASRGGLPSLA 

NGQGIYSVKPLLDTSRNLPATDEGDIISVQETSCL 

VTDKIKVTRRHYCYEQNWPHESTSFFSVKQRIKS 

FENLANADRPVAKSGASPFLSVSSKPPIGRRSSGS 

IVSGSLGHPGDAAARLLRRSLSSCSENQSEAGTL 

LPQMAKSPSIMTLTISRQNPPETSSKGSDSELKKS 

LGPLGIPTPTMTLASPVKRNKSSVRHTQPSPVSRS 

KLQELRALSMPDLDKLCSEDYSAGPSAVLFKTEL 

EITPRRSPGPPAGGVSCPEKGGNRACPGGSGPKT 

SAAETPSSASDTGEAAQDLPFRRSWSVNLDQLLV 

SAGDQQRLQSVLSSVGSKSTILTLIQEAKAQSENE 

FiSvPFTVi MnK"Pr5QnT mi^JVAriO'n^VTJPVQTTvw 
EtU V X V jbiN jxJsxtVJduirfsjr o v luw Eri^rwdi i v n 

RVFSQGAASQEGTMNRGDFLLSVNGASLAGLAH 

GNVLKVLHQAQLHKDALWIKKGNfDQPRPSAR 

QEPPTANGKGLLSRKTIPLEPGIGRSVAVHDALC 

VEVLKTSAGLGLSLDGGKSSVTGDGPLVIKRVY 

KGGAAEQAGIIEAGDEILAINGKPLVGLMHFDA 

WNIMKSVPEGPVQLLIRKHRNSS 


3648 


A 


337 


1564 


ksrlsvtlmpvqlsehpewnesmhslrisvgglp 

vlasmtkaadprfrprwkvvltffvgaailwll 

cshrpapgrppthnahnwrlgqapanwyndty 

plsppqrtpagiryriaviadldtesraqeentwf 

tylkkgyltfsdsgdkvavewdkdhgvleshl 

aekgrgmelsdlivfngklysvddrtgvVyqie 

VJ OrvrV V a VV V U^OJUrVji^Vj l V X-rJVvJ rWA.I_/ W J^/\ V K f tr.n^ 

lyvgglgkewttttgdvvnenpewvkvvgyk 

gsvdhenwvsnynalraaagiqppgylihesac 

wsdtlqrwfflprrasqerysekdderkganll 

lsaspdfgdiavshvgavvpthgfssfkfepntdd 

qiivalkseedsgrvasyimaftldgrfllpetki 

gsvkyegiefi 


3649 


A 


1 


775 


ptrpgsgsaggarvgsgefgvemaalaplpplpa 
ofksiohhlrtaoehdkrdpvvayycrlyamo 

tgmkidsktpecrkflsklmdqlealkkqlgdn 

eaitqeivgcahlenyalkmflyadnedragrf 

hknmiksfytasllidvitvfgeltdenvkhrky 

arwkatyihnclkngetpqagpvgieedndieen 

edagaaslptqptqpsssstydpsnmpsgnytgi 

qippgahapantpaevphstgvak 


3650 


A 


20 


963 


kmaatlgplgswqqwrrclsardgsrrlllll 

llgsgogpoovgagotfeylkrehslsb^pyoge 

aprpcflrdwelqvhfkihgqgkknlhgdglai 

wytkdrmqpgpvfgnmdkfvglgvfvdtypne 

ekqqervfpyisamvnngslsydherdgrptel 

ggctaivrnlhydtflviryvkrhltimmdidgk 

hewrdcievpgvrlprgyyfgtssitgdlsdnhd 

vislklfeltvertpeeeklhrdvflpsvdnmkl 
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SEQID 
NO: 


Method 


Predicted 
beginning 
nucleotide 

■U^HWUII 

corresponding 
to first amino 
acid residue or 
peptide 
sequence 


Predicted end 

nucleotide 

location 

coiTMnnndintr 

to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A'^AIanine C==Cysteine, D^Aspartic Acid, 
EX^Iutamic Add, F^Phenylaianine, G=Glyclne, H=Histidine, 
I=IsoIeucine, K^Lysine, I^Leucioe, M»Mettiionine, 

T=Threoaine, V-Valine, W=Tryptophan, Y«Tyrosine, 
X»Unknown, *»Stop codon, /=posslble nucleotide deletion, 
VFpossible nucteotide insertion 










PEMTAPLPPLSGLALFUVFFSLVFSVFAIVIGIILY 
NKWQEQSRKRFY 


3651 


A 


1 


1218 


RSWAYVKKCKNNMCPNRGLHDGPEPCWLHHA 

AGTVSAVQARGLQPSQSRSRPRVPGLATALAYG 

PAHTPPLSRIGWAMQPPPPGPLGDCLRDWEDLQ 

QDFQNIQVSAAADAGSPPSRVSLAQGQGSGSPGC 

Ki'SLPAEAEGAAQELENQMKERQGLFFDMEAYL 

PKKNGLYLSLVLGNVNVTLLSKQAKFAYKDEYE 

KFKT YT TTTT TT TSFTPRFT T NSRVTDAAFNFT T VW 

yyctltiresilrnngsrikgwwvfhhyvstflsg 
vmltwpdglmyqkfrnqflsfsmyqsfvqflq 
yyyqsgclyrlralgerhtmdltvegfqswmw 
rvltfllpflffghfwqlfnaltlfnlaqdpqck: 

EWOVLMCGFPFLLLFLGNFFTTLRWHHICFHSO 
RHGSKKD 


3652 


A 

r 


640 


164 


VTTSCnPFAFGLGVRASERLAEIDMPYLLKYQPM 

MQTIGQKYCMDPAVIAGVLSRKSPGDKILVNMG 

DRTSMVQDPGSQAPTSWISESQVFQTTEVLTTRI 

TELQRRFPTWTPDQYLRGGLCAYSGGAGYVRSS 

QDLSCDFCNDVLARAKYLKRHGF 


3653 


A 


2 


909 


IVRRDWQEVSDIHLAMANCKMTKSIRFPALEHC 

YTGGEVVLPKDQEEWKRRTGLLLYENYGQSETG 

j-*iv*A 1 1 w vjjyiTkJAJrvjr ivivjis.r\ i irir i u v v^r xiivlcao v 

ENCIIVSMNTADPGSQGITHSLLLQVIDDKGSILPP 

NTEGNIGIRIKPVRPVSLFMCYEGDPEKTAKVEC 

GDF YNTGDRGKMDEEG YICFLG RSDDIINASG YR 

IGPAEVESALVEHPAVAESAVVGSPDPIRGEVVK 

AFIVLTPQFLSHDKDQLTKELQQHVKSVTAPYKY 

PRKVEFVSELPKTITGKIERKELRKKETGQM 


3654 


A 


9 


909 


IVRRDWQEVSDIHLAMANCICMTKSIRFPALEHC 

YTGGEVVLPKDQEEWKRRTGLLLYENYGQSETG 

T TPATYWOMI^TTf PfrFMnKATPPYnVOFHA/fPA^lV 
jL^xv^A 1 I w vjivijxiiva vjx^ivivjivri 1 ir I u V \^jr niviJiiAij v 

ENCIIVSMNTADPGSQGITHSLLLQVIDDKGSILPP 

NTEGNIGIRIKPVRPVSLFMCYEGDPEKTAKVEC 

GDFYNTGDRGKMDEEGYICFLGRSDDIINASGYR 

IGPAEVESALVEHPAVAESAVVGSPDPIRGEVVK 

AFIVLTPQFLSHDKDQLTKELQQHVKSVTAPYKY 

PRKVEFVSELPKTITGKIERKELRKKETGQM 


3655 


A 


2 


2364 


SPGPSLPESAESLDGSQEDKPRGSCAEPTFTDTG 

MVAHDWSRLKAKGVGQHDNAQNFGNQSFEEL 

RAACLRKGELFEDPLFPAEPSSLGFKDLGPNSKN 

VQNISWQRPKDIINNPLFIMDGISPTDICQGILGDC 

WLLAAIGSLTTCPKLLYRVVPRGQSFKKNYAGIF 

HFQIWQFGQWVNWVDDRLPTKNDKLVFVHST 

ERSEFWSALLEKAYAKLSGSYEALSGGSTP^GL 

EDFTGGVAQSFQLQRPPQNLLRLLRKAVERSSL 

MGCSffiVTSDSELESMTDKMLVRGHAYSVTGLQ 

DVHYRGKMETLIRVRNPWGRIEWNGAWSDSAR 

EWEEVASDIOMOLLHKTEDGEFWMSYODFLNN 

FTLLEICNLTPDTLSGDYKSYWHTTFYEGSWRTG 

SSAGGCRNHPGTFWTNPQFKISLPEGDDPEDDAE 

GNWVCTCLVALMQKNWRHARQQGAQLQTIGF 

VLYAVPKEFQNIQDVHLKKEFFTKYQDHGFSEIF 

TNSREVSSQLRLPPGEYIIIPSTFEPHRDADFLLRV 

FTEKHSESWELDEVNYAEQLQEEKVSEDDMDQ 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to flrst amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


/Vmino acid sequence (A=»AIanine C'=Cysteine, D^Aspartic Acid, 
E^GIutamic Acid, f^Plicnylalanine, G-GIyclne, H^^Histidine, 
I^Isoleucine, K^Lysine, L^Leucine, M^^Mettiionlne, 
N»Asparagine, P^^Proline, Q^Glutamine, R^Arginine, S=Serine, 
T»Tbreonlne, V«Vflline, W=Tryptophan, V^Tyrosine, 
X"Unknown, "^^Stop codon» /^sposslble nucleotide deletion, 
Vipossible nucleotide insertion 










DFLHLFKIVAGEGKEIGVYELORLLNRMAIKFKS 

FKTKGFGLDACRCMINLMDKDGSGKLGLLEFKI 

LWKKLKKWMDIFRECDQDHSGTLNSYEMRLVIE 

KAGIKLNNKVMQVLVARYADDDLIIDFDSFISCF 

LELKTMFTFFLTMDPKNTGHICLSLEQVLGEGW 

EGICRJAPACPSTPPPPSSDVPGPASCPRLFPPWDL 

LPVSTVAADDHVGIEAL 


3656 


A 


3 


174 


PLCTHYLLPELPEKSSRTSPRSRPGNMLSGDPHLP 
QPLCHCLDHCPCCFSGKRLVA 


3657 


A 


1 


444 


DTRSTYHNAHSLPTYYKSPAPCQMTYIKSPAPCQ 

TQTCYVQGASPCQSYYVQAPASGSTSQYCVTDP 

CSAPCSTSYCCLAPRTFGVSPLRRWIQRPQNCNT 

GSSGCCENSGSSGCCGSGGCGCSCGCGSSGCCCL 

GEPMKSRSPALL 


3658 


A 

• 


92 


1537 


SEAPVQPQPYTMTSFYSTSSCPLGCTMAPGARNV 

FVSPIDVGCQPVAEANAASMCLLANVAHANRVR 

VGSTPLGRPSLCLPPTSHTACPLPGTCHIPGNIGIC 

GAYGKNTLNGHEKETMKFLISIDRLANYLEKVRQ 

LEQENAELETTLLERSKCHESTVCPDYQSYFRTIE 

ELQQKILCSKAENARLA^QIDNAKLAADDFRIKL 

ESERSLHQLVEADKCGTQKLLDDATLAKADLEA 

QQESLKEEQLSLKSNHEQEVKILRSQLGEKFRIEL 

WFQAQSEGISLQAMSCSEELQCCQSEILELRCTV 

NALEVERQAQHTLKDCLQNSLCEAEDRYGTELA 

QMQSLISNLEEQLSEIRADLERQNQEYQVLLDVK 

ARLENEIATYRNLTPLOSLFHACLLYFLSKLWPC 

HR^SLWPWSQHGEMILKARVRRLRLVALGSG 

VPSPCPVFLQD 


3659 


A 


2 


402 


DLLQCLNQLYSASTEMSCQQSQQQCQPPPKCTP 
KCPPKCTPKCPPKCPPKCPPQYSAPCPPPVSSCCG 
SSSGGCCSSEGGGCCLSHHRPRQSLRRRPQSSSC 
CGSGSGQQSGGSSCCHSSGGSGCCHSSGGCC 


3660 


A 


26 


710 


CSAVEVKMAARTAFGAVCRRLWOGLGNFSVTsIT 

SKGNTAKNGGLLLSTNMKWVQFSNLHVDVPKD 

LTKPVVTISDEPDILYKRLSVLVKGHDKAVLDSY 

EYFAVLAAKELGISIKVHEPPRKIERFTLLQSVHI 

YKKHRVOYEMRTLYRCLELEHLTGSTADVYLEY 

IQRNLPEGVAMEVTKFCFFIFLDTIRTVTRTHQGA 

NLGNTIRRKRRKQVIKPQGGHFCLMLK 


3661 


A 


2 


370 


DVSVAASEPTVYRNPTKMSCQQNQQQCQPPPKC 

PIPKYPPKCPSKCASSCPPPISSCCGSSSGGCCSSG 

GCGCCSSEGGGCCLSHHRHHRSHCHRPKSSNCY 

GSGSGQQSGGSGCCSGGGCC 


3662 


A 


205 


1277 


RKSLPHPNPQKMLKKPLSAVTWLCIFIVAFVSHP 

AWLQKLSKHKTPAQPQLKAANCCEEVKELKAQ 

VANLSSLLSELNKKQERDWVSVVMQVMELESN 

SKRMESRLTDAESKYSEMNNQIDIMQLQAAQTV 

TOTSAGKETSPLRERGVPPHLOHCFYIPPDDFLGS 

PELEVFCDMETSGGGWniQRRKSGLVSFYRDW 

KQYKQGFGSIRGDFWLGNEHIHRLSRQPTRLRVE 

MEDWEGNLRYAEYSHFVLGNELNSYRLFLGNY 

TGNVGNDALQYHNNTAFSTKDKDNDNCLDKCA 

QLRKGGYWYNCCTDSNLNGVYYRLGEHNKHLD 

GITWYGWHGSTYSLKRVEMKIRPEDFKP 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C^ysteine, D=Aspartic Acid, 
EK»Iutamic Acid, F^Phenylalanine» G«Glycine, H=Histidine, 
I»Isoteucine, K=Lysine, l^Leucine, M-Methionine, 
N^Asparagine, F^ProIine, Q^GIutaminc, R«Arginine, S^Serine, 
T«Threonine, V=VaIinc, W=Tryptophan, Y-Tyroslne, 
X-Unknown, *=Stop codon,/=^05sible nucleotide deletion, 
\»po5sible nucleotide insertion 


3663 


A 


64 


1456 


LSSAKETLAQMYNTVWNMEDLDLEYAKTDINC 

GTDLMFYmMDPPALPPKPPKPTTVA>nsrGMNNN 

MSLQDAEWYWGDISREEVNEKLRDTADGTFLV 

RDASTKMHGDYTLTLRKGGNNKLIKIFHRDGKY 

GFSDPLTFSSWELINHYRNESLAQYNPKLDVKL 

LYPVSKYQQDQWKEDNIEAVGKKLHEYNTQFQ 

EKSREYDRLYEEYTRTSQEIQMKRTAIEAFNETIK 

DFEEOCOTOERYSKEYIEKFKREGNEKEIORTMHN 

YDKLKSRISEIIDSRRRLEEDLKKQAAEYREIDKR 

MNSIKPDLIQLRKTRDQYLMWLTQKGVRQKKL 

NEWLGNENTEDQYSLVEDDEDLPHHDEKTWNV 

GSSNRNKAENLLRGKRDGTFLVRESSKQGCYAC 

SVVVDGEVKHCVINKTATGYGFAEPYNLYSSLK 

ELVLHYOHTSLVOHNDSLNVTLAYPVYAOORR 

will* V JLJIX A. X^XJ T ^^XXX^X^kJX^l^ v Jl XaJ^ X X W X £WJ\y^\^^ 


3664 


A 


944 


406 


GATVEDQSCNFGSLRWWSVPfflSARSCPDPLLS 

RTGRVPGGRGAGLPRHHSPRCCLQVFFNGANVR 

QVDVPTLTGAFGILAAHVPTLQVLRPGLWVHA 

EDGTTSKYFVSSGSIAVNADSSVQLLAEEAVTLD 

MLDLGAAKANLEKAQAELVGTADEATRAEIQIR 

lEANEALVKALE 


3665 


A 


98 


1388 


ASQLAFGGKLTSTPSRDFQGCGRGAVTCCSFHEH 

RHQSGRCLSTGMAPNLKGRPRKKKPCPQRRDSF 

SGVKDSNNNSDGKAVAKVKCEARSALTKPKMN 

HNCKKVSNEEKPKVAIGEECRADEQAFLVALYK 

YMKERKTPIERIPYLGFKQINLWTMFQAAQKLG 

GYETTTAFOIQWKHIYDELGGNPGSTSAATCTRR 

NENKTKVSGTBCfOKHEIPKSKKEKENAPKPQDAA 

EVSSEQEKEQETLISQKSIPEPLPAADMKKKIEGY 

QEFSAKPLASRVDPEKDNETDQGSNSEKVAEEA 

GEKGPTPPLPSAPLAPEKDSALVPGASKQPLTSPS 

ALVDSKQESKLCCFTESPESEPQEASFPRLPHHTG 

HRWQTRMRRRMTNCPPWQITLPTAP 


3666 


A 


113 


1492 


LLQEMCTKTIPVLWGCFLLWNLYVSSSQTIYPGI 

KARTTQRALDYGVQAGMKMIEQMLKEKKLPDL 

SGSESLEFLKVDYVNYNFSNIKISAFSFPNTSLAF 

VPGVGDCALTNHGTANISTDWGFESPLFVLYNSF 

AEPMEKPILKNLNEMLCPIIASEVKALNANLSTLE 

>aTKIDNYTLLDYSLISSPEITENYLDLNLKGVFY 

PLENLTDPPFSPVPFVLPERSNSMLYIGUEYFFKS 

ASFAHFTAGVFNVTLSTEEISNHFVONSOGLG>A/ 

LSRIAEIYILSQPFMVRIMATEPPIINLQPGNFTLDI 
PASIMMLTOPKNSTVETrVSMDFVASTSVGLVIL 

gqrlvcslslnrfrlalpesnrsnievlrfenilss 
ilhfgvlplanaklqqgfplpnphkflfvnsdiev 
legfllistdlkyetsskoopsfhvweglnlisro 

wrgksap 


3667 


A 


1 


181 


frgrlgsgrngggsmnappafesfllfegekitin 
kdtkvpnaclftinkedhtlgniik 


3668 


A 


212 


431 


vageavpffpmmyseplbcpsylalvlwyflltg 
ycitkpevifkffiqgeepwilekgfpsqchpakyl 

WCLHD 


3669 


A 


458 


1056 


fsgvcfagiagsmatllhdavmnpaevvkqrlq 
mynsqhrsaiscirtvwrteglgafyrsyttqlt 
mnipfqsihfityeflqeqvnphrtynpqshiisgg 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C=Cysteiney D=Aspartic Acid, 
E^GIntamic Acid, F»Pbenylalanine, G«Glycine, H^Histidlne, 
I=Isoleuciney K^Lysine, L^Leucine, M^Methionine, 
N^Asparagine, P^'Proline, Q=GlutainlDe, R=Arginine» S^Serine, 
T'Tlireonine, V=Vallnc, W=Tryptoplian, Y^Tyrosinc, 
X~Unkno\vn, *-Stop codon, /^possible nucleotide deletion, 
\apossible nucleotide insertion 










LAGALAAAATTPLDVCKTLLNTQENVALSLANIS 
GRLSGMANAFRTVYQLNGLAGYFKGIQARVIYQ 
MPSTAISWSVYEFFKYFLTKRQLENRAPY 


3670 


A 


145 


298 


RNPCPLTFLPSTLMVLLLSLTFFSALTFHSICOLRN 
TGVEVDIVFQRVSFL 


3671 


A 


3 


462 


ILKVAKKERTMSSLPVPYKLPVSLSVGSCVIIKGT 

PIHSFINDPQLQVDFYTDMDEDSDIAFRFRVHFG 

NHVVMNRREFGIWMLEETTDYVPFEDGKQFELC 

lYVHYNEYEIKVNGHTHLRALSHREPPSFVEDGC 

KCPRRYLPWTSVCVCN 


3672 


A 


1 


1028 


HYAKLGTRPRLKFMSSPSLSDLGKREPAAAADE 
RGTQQRRACANATWNSIHNGVIAVFQRKGLPDQ 
ELFSLNEGVRQLLKTELGSFFTEYLQNQLLTKGM 
VILRDKIRFYEGQBCLLDSLAETWDFFFSDVLPML 

ALARAHARVPPAIVQMLLVLQGVHESRGVTEDY 

LRLETLVQKVVSPYLGTYGLHSSEGPFTHSCILEK 

RLLRRSRSGDVLAKNPVVRSKSYNTPLLNPVQE 

HEAEGAAAGGTSIRKHSVSEMTSCPEPQGFSDPP 

GQGPTGTFRSSPAPHSGPCPSRLYPTTQPPEQGLD 

PTRS 


3673 


A 


2 


712 


RPPRVWYPELRELSAAAPRWSHRTAPGIMVFYF 

WFHVDKLSSAHVYLRLHKGENIEDIPKEVLMDC 

AHLVKANSIQGCKMNNVNVVYTPWSNLKKTAD 

MDVGQIGFHRQKDVKIVTVEKKVNEILNRLEKT 

KVERFPDLAAEKECRDREERNEKKAQIQEMKKR 

EKEEMKJCKitEMDELRSYSSLMKVENMSSNQDG 

NDSDEFM 


3674 


A 


2 


712 


RPPRVWYPELRELSAAAPRWSHRTAPGIMVFYF 

WFHVDKLSSAHVYLRLHKGENEEDIPKEVLMDC 

AHLVKANSIQGCKMNNVNVVYTPWSNLKKTAD 

MDVGQIGFHRQKDVKIVTVEKKVNEILNRLEKT 

KVTRFPDLAAEKECRDREERNEKKAOIOEIVIKKR 

EKJEEMKKKREMDELRSYSSLMKVENMSSNQDG 

NDSDEFM 


3675 


A 


921 


1321 


VTLAKMRVfflSSCLKVQEQMANCPKFVPWPTS 
QPIPSNIPNRSTFACPYCGARNLDQQELVKHCVE 
SHRSDPNRVVCPICSAMPWGDPSYKSANFLQHL 
LHRHKFSYDTFVDYSIDEEAAFQAALALSLSEN 


3676 


A 


3 


1856 


TLGRWLLGVYETVAPTLACLPRPRLRRJRRRRRR 

RRNnSRYTRXAWQSLELKGITKHALNHHPPPEK 

LEEISPTSDSHEKDTSSQSKSDITRESSFTSADTGN 

SLSAFPSYTGAGISTEGSSDFSWGYGELDQNATE 

KVQTMFTAIDELLYEQKLSVHTKSLQEECQQWT 

ASFPHLRILGRQIITPSEGYRLYPRSPSAVSASYET 

TLSQERDSTIFGIRGKKLHFSSSYAHKASSIAKSSS 

FCSMERDEEDSIIVSEGIIEEYLAFDHIDIEEGFHG 

KKSEAATEKQKLGYPPIAPFYCMKEDVLAYVFD 

SVWCKWSCMEQLTRSHWEGFASDDESNVAVT 

RPDSESSCVLSELHPLVLPRVPQSKVLYITSNPMS 

LCQASRHQPNVNDLLVHGMPLQPRM-SLMDKLL 

DLDDKLLMRPGSSTILSTRKWPNRAVEFSTSSLS 

YTVQSrRRRNPPPRTLHPISTSHSCAETPRSVEEIL 
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SEQ ID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A-Alanine C^Cysteine, D^Aspartic Add, 
£=GIutamic Acid, F^Fbenylalanine, G'=G!ycine, H«Histidine, 
l^IsoJeucjoe, K-Lysine, JL^Leudne, M=Metbioninc,* 
NsAsparagine, P»FroHnc, Q=Glutamine, R»Arginine, S^^erine, 
T=Thrconlne, V=VaIine, W=Tryptophan, Y=Tyrosine, 
X=Unknown, *'^top CDdon> /=possible nucleotide deletion, 
V=possibIe nucleotide insertion 










RGARVPVAPDSLSSPSPTPLSRNNLLPPIGTAEVE 

HVSTVGPQRQMKPHGDSSRAQSAWDEPNYQQ 

PQERLLLPDFFPRPNTTQSFLLDTQYRRSCAVEYP 

HQARPGRGSAGPQLHGSTKSQSGGRPVSRTRQG 

P 


3677 


A 


246 


757 


MRLQGAIFVLLPHLGPILVWLFTRDHMSGWCEG 

PRMLSWCPFYKVLLLVQTAIYSVVGYASYLVWK 

DLGGGLGWPLALPLGLYAVQLTISWTVLVLFFT 

VHNPGLALLHLLLLYGLVVSTALIWHPINKLAAL 

LLLPYLAWLTVTSALTYHLWRDSLCPVHQPQPT 

EKSD 


3678 


A 


20 


1508 


RGKAEFFLAMAGTNALLMLENFIDGKFLPCSSYI 

DSYDPSTGEVYCRVPNSGKDEIEAAVKAAREAFP 

SWSSRSPQERSRVLNQVADLLEQSLEEFAQAESK 

DQGKTLALARTMDIPRSVQNFRFFASSSLHHTSE 

CTQMDHLGCMHYTVRAPVGVAGLISPWNLPLY 

LLTWKIAPAMAAGNTVIAKPSELTSVTAWMLCK 

LLDKAGVPPGWNIVFGTGPRVGEALVSHPEVPL 

ISFTGSQPTAERITQLSAPHCKKLSLELGGKNPAn 

SIYSEFLKRFVEATRKWKVGIPSDPLVSIGALISK 

AHLEKVRSYVKRALAEGAQIWCGEGVDKLSLPA 

KNQAGYFMLPTVITDIKDESCCMTEEIFGPVTCV 

VPFDSEEEVIERANNVKYGLAATVWSSNVGRVH 

RVAKKLQSGLVWTNCWLIRELNLPFGGMKSSGI 

GREGAKDSYDFFTEIKTITVKH 


3679 


A 

* 


1862 


502 


MAGTKPYMEIQTTIREYYEHLYANKLENLEEMD 

KFLDTYTLPRLNQEEVESLNRPITGSEIEAIINSLP 

TKXIPGPDRFTAKFYQRYKEELSNLIHYLGLSHH 

LLALNFIIVSFGKKSAWSSAQVKVTDTDFDGVEV 

RVFEGPPKPEEPLKRSVVYIHGGGWALASAKIRY 

YDELCTAMAEELNAVIVSIEYRLVPKVYFPEQIH 

DVVRATKYFLKPEVLQKYMVDPGRJCISGDSAG 

GNLAAALGOOFTODASLKNKLKLOALIYPVLOA 

LDFNTPSYQQNVNTPILPRYVMVKYWVDYFKG 

NYDFVQAMVNNHTSLDVEEAAAVRARLNWTS 

LLPASFTKNYKPVVQ1TGNARIVQELPQLLDARS 

APLIADQAVLQLLPKTYILTCEHDVLRDDGIMYA 

KRLESAGVEVTLDHFEDGFHGCMIFTSWPTNFSV 

GIRTRNSYIKWLDQNL 


3680 


A 


249 


2146 


RSWGAPWFWRMRLLRRRHMPLRLAMVGCAFV 

LFLFLLHRDVSSREEATEKPWLKSLVSRKDHVLD 

L^ILEAMNNLRDSMPKLQIRAPEAQQTLFSINQSC 

LPGFYTPAELKPFWERPPQDPNAPGADGKAFQK 

SKWTPLETQEKEEGYKKHCFNAFASDRISLQRSL 

GPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 

TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKE 

KLEQYVKQLQVVRVVRQEERKGLITARLLGASV 

AQAEVLTFLDAHCECFHGWLEPLLARIAEDKTV 

WSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWS 

LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSI 

SKSYFEHIGTYDNQMEIWGGENVEMSFRVWQC 

GGQLEIIPCSVVGHVFRTKSPHTFPKGTSVIARNQ 

VRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFG 

DISERLQLREQLHCHNFSWYLHNVYPEMFVPDL 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to First amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A'^Alanine C'=Cystcine> D-Aspartic Acid, 
E^lutamic Acid, F«Phenylalanine, G=Glycine, H»Histidine, 
I=IsoJeDcinef K^Lysine, LsLeudne, M=MetiiiDiiine, 
N=Asparagine, P^FroUne, Q^Glutamine, R»ArginiRe» S«Serine, 
T«Tbreonine, V-Valine, W-Tryptophan, Y»Tyrosine, 
X«Unknown, *=Stopcodon, /-possible nucleotide deletion, 
V^possible nncleotidc insertion 










TPTFYGAIKNLGTNQCLDVGENNRGGKPLIMYS 
CHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 
ALGLGSCHFTGKNSQVPKDEEWELAQDQLIRl^S 
GSGTCLTSQDKKPAMAPCNPSDPHQLWLFV 


3681 


A 


2982 


1869 

■ 


LKDTLKSQMTQEASDEAEDMKEAMNRMIDELN 

KQVSELSQLYKEAQAELEDYRKRKSLEDVTAEY 

IHKAEHEKLMQLTNVSRAKAEDALSENIKSQYSK 

VLNELTQLKQLVDAQKENSVSrrEHLQVITTLRT 

AAKEMEEKISNLKEHLASKEVEVAKLEKQLLEE 

KAAMTDAMVPRSSYEKLOSSLESEVSVLASKLFC 

ESVKEKEKVHSEVVQmSEVSQVKREKENIQTLL 

KSKEQEVNELLQKFQQAQEELAEMKRYSESSSK 

LEEDKDKKINEMSKEVTKLKEALNSLSQLSYSTS 

SSKRQSQQLEALQQQVKQLQNQLAECKKQHQE 

VISVYRMHLLYAVQGQMDEDVQKVLKQILTMC 

KNOSOKK 


3682 


A 


447 


1024 


AQALTAGRQLALAAPFIAPISPISLPRLNPPSQSW 

NSTPFFKVKLPPQKEVITSDELMAHLGNCLLSIKP 

QEKSEGLQLNFQQNVDDANfTVLPKLATGLDVN 

VRFTGVSDFEYTPECSVFDLLGIPLYHGWLVDPQ 

QSPEAVRAVGKLSYNQiyVGEDHHLQrLQ*HQP 

RDRKPDCRAVPGDHRGPSDLPRTV 


3683 


A 


2 


942 


LEIKQEEKI^GQCIKEELMHGECVKEEKDFLKKE 
IVDDTKVKEEPPINHPVGCKRKLAMSRCETCGTE 
EAKYRCPRCMRYSCSLPCVKKHKAELTCNGVRD 
KTAYTSTOOFTEMNLLSnYRFLFnVARTAnHKR 

DAFLKRPISNKYMYFMKNRARRQGINLKLLPMG 

FTKRKENSTFFDKKKQQFCWHVKLQFPQSQAVSr 

♦KKRVPDDKTINEILKPYIDPEKSDPVIRQRLKAYI 

RSQTGVQILMKIEYMQQNLVRYYELDPYKSLLD 

NLRNKVIIEYPTLHWLKGS^fNDMKVLHQYKSE 

STKNVGNEN 


3684 


A 


119 


1533 


SLQENVQEKRVRVCPGLGGLLPNGTPSITAAAAP 

QVLWRHVQPGCSHHLHACVIRAACRAGEGHAD 

RHAGPPET/PVTLPSSAVPWSSPWERQCPMH\L*AP 

GHAFRPVPTEHRRGWAALGHHRAAAGPLREPAS 

GSQPAPASC*PECHHGCPEQTRQCQDLLREAW 

APEQRG*PCAHLQT*ATATTLCPQVPAGRVWQP 

GHSCHLLPHRHDGSH*HHCAAHRRPVTRRQAAH 

GVPLPDACYSPHHTLPAAPPPATRPAGHTATHPE 

♦GGDLTPVPDGPHDCPRD VOG IPG AGGGSOLAPC 

CPPFPAAPVS VQGTQGLGPKNVLH* Q WEGIRWQ 

KEPE/PGPPPEVELKRGAKCRIGDHGLGAVLGQG 

EYAS*SPSIPW*ASSSACPPLHPTP/TVYTQSPAAA 

PGWTRPPSP/PPPGLYPGP/PASHAPGVRGGISHOL 

YSLP*LCRECCSCP/PPPPAHGGRCPSLLPPEALAK 

LLL 


3685 


A 


101 


438 


AWVLQCKINTELQTEVVMLKSMVLWLGEQVQS 
LQLQQQLHCHFNHTHICVTNLEYN\KEYPWDLV 
KAHLQGASTSNITFDIGELQKKMLDLNKQTQEFQ 
PSL*AWTEFQQGLE 


3686 


A 


105 


845 


VSDWKNQLVEVQCRQDGCDAVENVHQMFMF 
NWFTDCLWTLFLSNYQPSVESSSPGGSATSDDHE 
FDPSADMLVHDFDDERTLEEEEMMEGETNFS SEI 
EDLAREGDMPIHELLSLYGYGSTVRLPEEDEEEE 
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SEQ ID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A==Alaninc C=Cystcine, D^Aspartic Acid, 
E<=Glutamic Add, F=Phenylalanine, G=Glycine, H^^Histidine, 
i^Isoleucine, K<°Lysine, L=Leucine, M^Methionine, 
N-Asparagine, P»ProUnc, Q=GlutBinine, R^Arginine, S=Serine, 
T^Threonine, V=Vallnc, W=Tryptophan, Y»Tyrosine, 
X=Unl(nown, *°Stop codon, /^possible nudeotide deletion, 
\-possible oudeotide Insertion 










EEEEEGEDDEDADNDDNSGCSGENKEEMKDSS 
GQEDETQSSNDDPSQSVASQDAQEURPRRCKYF 
DTNSEVEEESEEDEDYIP/SnSFFQSSDGI*SSSSSE 
DWKKEIMVGS 


3687 


A 


49 


1225 


PVLVTSLRMREADTLRPPQLMEVSADIISTVEFN 

HTGELLATGDKGGRWIFQREPESKNAPHSQGE 

YDVYSTFQSHEPEFDYLKSLEffiEKINKIKWLPQQ 

NAAHSLLSTNDKTIKLWKITERDKllPEGYNLKDE 

EGKLKDLSTVTSLQVPVLKPMDLMVEVSPRRIFA 

NGHTYHINSISVNSDCETYMSADDLRINLWHLAI 

TDRSFTP\NIVDIKPANMEDLTEVITASEFHPHHC 

NLFVYSSSKGSLRLCDMRAAALCDKHSKLFEEPE 

DPSNRSFFSEnS\SVSDVKFSHSDRYMLTR\DyLT 

VKVWDLMvlEARPIETYOVHDYLRSKLCSLYEND 

CIFDKFECAWNGSDR/IIMTGAYNNFFRMFDRNT 

BCRDVTLEASRGSSKPRAVL 


3688 


A 


1 


401 


KKVPGRLSEMSFSLNFTLPANTTSSPV1\DCGPSL 
GLAAGIPLLVATALLVALLFTLIHRRRSSIEAMEE 
SDRPCEISEIDDNPKISENPRRSPTHEKNTMGAQE 
AHIYVKTVAGSEEPVHDRYRPTIEMERRR 


3689 


A 


698 


889 


GRVLVHCAMGVSRSATLVLAFLMIYENMTLVEA 
IPDGAGPPQISALTX3AFVRQLQVLDNRLGRE 


3690 


A 


61 


153 


MGAHLVRRYLGDASVEPDPLQMPTFPPDYGF 


3691 


A 


61 


153 


MGAHLVRRYLGDASVEPDPLQMPTFPPDYGF 


3692 


A 


3 


2831 


PLVRRLLRQTLRRVGGARAVREAVMRAVLTWR 

DKAEHCINDIAFKPDGTQLILAAGSRLLVYDTSD 

GTLLQPLKGHKDTVYCVAYAKDGKRFASGSAD 

KSVUWSKLEGILKYTHNDAIQCVSYNPITHQLA 

SCSSSDFGLWSPEQKSVSKHKSSSKIICCSWTNDG 

QYLALGMFNGIISIRNKNGEEKVKIERPGGSLSPI 

WSICWNPSSRWESFWMNRENEDAEDVIVNRYIQ 

EIPSTLKSAVYSSQGSEAEEEEPEEEDDSPRDDNL 

EERNDILAVADWGVQKVSFYQLSGKQIGKDRAL 

NFDPCCISYFTKGEYILLGGSDKQVSLFTKDGVR 

LGTVGEQNSWVWTGQAKPDSNYVVGGCQDGn 

SFYQLIFSTVHGLYKDRYAYRDSMTDVIVQHLIT 

EQKVRIKCKELVKKIAIYRNRLAIQLPEKILIYELY 

SEDLSDMHYRVKEKIIKKFECNLLVVCANHIILC 

QEKRLQCLSFSGVKEREWQMESLIRYIKVIGGPP 

GREGLLVGLBCNGQILKIFVDNLFAIVLLKQATAV 

RCLDMSASRKKLAVVDENDTCLVYDIDTKELLF 

QEPNANSVAWNTQCEDMLCFSGGGYLNIKASTF 

PVHRQKLQGFVVGYNGSKIFCLHVFSISAVBVPQ 

SAPMYQYLDRKLFKEAYQIACLGVTDTDWRELA 

MEALEGLDFETAKKERKKRGETNNDLFLADVFS 

YQGKFHEAAKLYKRSGHENLALEMYTDLCMFE 

YAKDFLGSGDPKETKMLITKGADWARNIKEPKA 

AVEMYISAGEHVKAIEICGDHGWVDMLIDIARK 

LDKAEREPLLLCATYLKKLDSPGYAAETYLKMG 

DLKSLVQLHVETQRWDEAFALGEKHPEFKDDIY 

MPYAQWLAENDRFEEAQKAFHKAGRQREAVQV 

LEQLTNNAVAESRFNDAAYYYWMLSMQCLDIA 

QDPAQKD 


3693 


A 


3 


1099 


SSFPTCMRTVFHSNTSVSSLLHRPGHVTPQLTIHG 
GWRHHRDHTAIDEWDFNPSKFUYTCLLLFSYLL 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
locatioD 
eorresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C=Cystelnef D»Aspartic Acid, 
E^Glutamic Acid, F^Phenylalanine, G=Glycine, H^Histidiae, 
I»Isoleucine, K-Lysine, I^Lcucine, M»Methionine, 
N'^Asparagine, P^Proline, Q=Glutamine, R»Arglnine, SaSerlne, 
T=TlireoDine, V=Valine, W=Tryptophan, Y«Tyroslne, 
X-Unknown> *=Stop codon, /^possible nucleotide deletion, 
V»possible nucleotide insertion 










PLRLDGUQWSYWAVFAPIWLWKLLWAGASVG 

AGVWARI^PRYRTEGEACVEFKAMLIAVGIHLLL 

LMFEVLVCDRVERGTHFWLLVFMPLFFVSPVSV 

AACVWGFRHDRSLELEILCSVNILQFIFIALKLDRJ 

IHWPWLWFVPLWILMSFLCLWLYYIVWSLLFL 

RSLDVVAEORRTHVTMAISWITIVVPLLTFEVLL 

VHRLDGHNTFSYVSIFVPLWLSLLTLMATTFRJIK 

GGNHWWFAIRRDF/CQDQLPQPTGKPPPPPLTDH 

HGEKALPLQNKDRGSWPASRGSPRLL 


3694 


A 


483 


761 


PRSLIDYKSYMDTKLLVARFLEQSSCTMTPDIHE 
LVENIKSVLKSDEEHMEEAITSASFLEQIMAHSX 
OHIRAHKLPXETAGLXTSELRXLTP 


3695 


A 


483 


761 


PRSLIDYKSYMDTKLLVARFLEQSSCTMTPDIHE 
LVENIKSVLKSDEEHMEEAITSASFLEQIMAHSX 

QHIRAHKLPXETAGLXTSELRXLTP 


3696 


A 


456 


733 


LSAALWEEPILSLWSETKELTNRGKMNYPOIGPH 
RPHVKGLRVRPGPGTLSNAPKSLCPGMSNSDRGI 
H\GGEGQGPGKRAGHLGRGGGMSFL 


3697 


A 


877 


1873 


VWL*TLS*HTCALMTVCRSCLVKYLEENNTCPT 
CRIVIHQSHPLQYIGHDRTMQDIVYKLVPGLQEA 
EMRKQREFYHKLGMEVPGDIKGETCSAKQHLDS 
HRNOETKADDSSNKEAAE 


3698 


A 


1 


572 


ICQCGIPHEVVRDENSSVYAEVSRLLLATGHWKR 

LRRDNPRFNLMLGERNRLPFGRLGHEPGLVQLV 

NYYRGADKLCRKASLVKLIKTSPELAESCTWFPE 

SYVIYPTNnLKTPVAPAQNGIQPPISNSRTDEREFFL 

A S YKRKKEDGEGNV WIAKS S AG AKV WVO W* M 

TDLEEEroiPSPVGLGLESEWPL 


3699 


A 


2008 


2432 


LHCKMGALETQTHPCSQNMLRSLQKCCCKVEE 

HHLQPVQVLQTLLHSATAGTGCRRPARPPPAPPT 

PTPWRSRQSGKQSERAS*LKGRGRYGLGALGGR 

GGRALGGSRWPPPLPGETLFSGCKHRRRRRGSD 

AAPGEEAGT 


3700 


A 


33 


1318 


GYQIGMALASGPARRALAGSGQLGLGGFGAPRR 

GAYEWGVRSTRKSEPPPLDRVYEIPGLEPITFAG 

KMHFVPWLARPIFPPWDRGYKDPRFYRSPPLHE 

HPLYKDQACYIFHHRCRLLEGVKQALWLTKTKL 

lEGLPEKVLSLVDDPRNHIENQDECVLNVISHARL 

WQTTEEffKRETYCPVIVDNLIQLCKSQILKHPSL 

ARRICVONSTFSATWNRESLLLOVRGSGGARLST 

KDPLPTIASREEIEATKNHVLETFYPISPIIDLHECN 

lYDVKNDTGFQEGYPYPYPHTLYLLDKANLRPH 

RLQPDQLRAKMILFAFGSALAQARLLYGNDAKV 

LEOPWVOSVGTDGRVFHFLVFOLNTTDLDSNE 

GVKNLAWVDSDQLLYQHFWCLPVIKKRWVEP 

VGPVGFKPETFRKFLALYLHGAA 


3701 


A 


86 


465 


WTLCGPEAGMVGYDPKPDGRNNTKFQVAVAGS 
VSGLVTRALISPFDVIKIRFQLQHERLSRSDPSAK 
YHGILQASRQILQEEGPTAFWKGHVPAQILSIGY 
GAVQFLSFEMLTELVHRGSVYDARE 


3702 


A 


166 


814 


GFWEKTNQSSHSMDPLGAPSQFVDVDTLPSWGD 
SCQDELNSSDTTAEEFQEDTVRSPFLYNKDVNGK 
WLWKGDVALLNCTAIVNTSNESLTDKNPVSESI 
FMLAGPDLKEDLQKLKGCRTGEAQLTKGFhnLAA 
RFIlHTVGPKYKSRYRTAAESSLYSCYRlNfVLQLA 
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S£Q ID 
NO: 


Metbod 


Predicted 

beginniog 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C=Cysteine, D»Aspartic Acid, 
£=<:iu(amic Acid, JF^Plienylalanioe, G^GiycinCt H»Histidine, 
iFlsolencinc, K^Lysine, LRLcuclne, IVI»Methionine, 
N»Asparagine, P^Proline, QaGlutaroine* R'Arginine, S^erine, 
T=Threoninc, V^Valine, W=Tryptophan, Y=Tyrosine, 
X'^Unlinown, *-Stop codon, /-possible nucleotide deletion, 
\spossibie nucleotide insertion 










KEQSMSSVGFCVINSAKRGYPLKDATHIALRTVR 
RFLEIHGETIEKVV 


3703 


A 


128. 


1255 


SLGPSPKSATIPCCGDTMAPEEDAGGEALGGSFW 

EAGNYRRTVQRVEDGHRLCGDLVSCFQERARIE 

KAYAQQLADWARKWRGTVEKGPQYGTLEKAW 

HAFFTAAERLSALHLEVREKLQGQDSERVRAWQ 

RGAFHRPVLGGFRESRAAEDGFRKAQKPWLKRL 

KEVEASKKSYHAARKDEKTAQTRESHAKADSA 

VSQEQLRKLQERVERCAKEAEKTKAQYEQTLAE 

LHRYTPRYMEDMEQAFETCQAAERQRLLFFKD 

MLLTLHQHLDLSSSEKFHELHRDLHQGIEAASDE 

EDLRWWRSTHGPGMAMNWPQFEEWSLDTQRTI 

SRKEKGGRSPDEVTLTSIVPTRDGTAPPPQSPGSP 

GTGQDEEWSDEESP 


3704 


A 


1 


271 


ARGEDLALATGGGPDTVTHSNMPCPNSLVYDC 
WLNIKECSVGEHTFEDLGLCPGRNQREKKRSYK 

DFLREEEKIAAQVRNSSKKKLKDSE 


3705 


A 


170 


1318 


LNWANLVIMWPREEEKEKVQDYSLGGLSPDLRI 

DVSRKKKILKAYDEDEDEDLYPDIHPPPSLPLPG 

QFTCPQCRKSFTRRSFRPNLQLANMVQIIRQMCP 

TPYRGNRSNDQGMCFKHQEALKLFCEVDKEAIC 

VVCRESRSHKQHSVLPLEEWQEYKAKLQGHVE 

PLRKHLEAVOKMKAKEERRVTELKSOMKSELA 

AVASEFGRLTRFLAEEQAGLERRLREMHEAQLG 

RAGAAASRLAEQAAQLSRLLAEAQERSQQGGLR 

LLQDIKETFNRCEEVQLQPPEVWSPDPCQPHSHD 

FLTDAIVRKMSRMFCQAARVDLTLDPDTAHPAL 

MLSPDRRGVRLAERRQEVADHPKRFSADCCVLG 

AQGFRSGRHYWEVCMGP 


3706 


A 


204 


1996 


SRERQTTWMDHNFAPAPPEMQSHGAPGPGTSFS 

HSHVLGRPIRPSRLPGGGSPLTPVLRKTIHLDTFP 

QSHIPQTSSRLGLGARTRSVPPQETGIALGASLSP 

LPTSSLVPRKLSSISLTLHQNSQARSLDRPLSHWE 

ELPTPGKKAAPHEGGRVSSPGSPPVTLVPGGRVH 

SEGPGNPGLTKSNRMLATEKPLVSSYLALPFQSR 

LAQSAPVLAEPGSLGQGHLVSVTDHMPTRASPG 

KGKPRARGIPRPRGRLQRANTTVNLTAMDTRTD 

AARHLATMATNRPSLAINLATPNTSQLDTGTEFP 

ALDIKLGTARDLSSVGTVKSGKTVNLATAGTIKP 

GTAA^TTVGTTKPGMVMDLIASEPDKLGKAM 

ATRSTAKPDMTTEGIAMDSATSDPVKPDTITATV 

GTSRLETAMALARVNRAKLGTAKNSLALDTSR 

MGTAVGSVVPVTPDPATGKTTLGSVNNLTISDV 

ATCLLMPSRSTDLALDNTNAAMDRATEPASLDL 

ATEYKGKCRNLVGDGLGCREGEVCELGDGSMK 

PMSINSNLLGYIGIDTIIEQMRKKTMKTGFDFNIM 

VVGTEGCGAAAGLVAGSTKDPISFPQ 


3707 


A 


3 


549 


SSSISRDFLGQAACASGTMLRWLRDFVLPTAACQ 

DAEQPMRYETLFQALDRNGDGVVDIGELQEGLR 

NLGIPLGQDAEEKIFTTGDVNKDGKLDFEEFMKY 

LKDHEKKMKLAFKSLDKNNDGKIEASEIVQSLQ 

TLGLTISEQQAELILQSIDVDGTMTVDWNEWRD 

YFLFNPVTDIEEIIR 


3708 


A 


1 


1866 


EFRGAGRANMLAPRGAAVLLLHLVLQRWLAAG 
AQATPQVFDLLPSSSQRLNPGALLPVLTDPALND 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nadcotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C^Cysteine, D^Aspartic Acid, 
£>=GIutamic Add, F^Phenylalanine, G^lydne, H=Histidine, 
I^Isoleucine, K^Lysine, L^Lcncine, M^Methiomoe, 
N>=Aspftragine, F»Prolinc» Q=Clutamf ne, R»Arginine, &=Scrine, 
T'^'Threonine, V=VaIine, W«Tryptoplian, Y=Tyrosine, 
X^Unknown, *=Stop codon, /^possible nucleotide deletion, 
Vapossible nucleotide insertion 










LYVISTFKLQTKSSATIFGLYSSTDNSKYFEFTVM 

GRLSKAILRYLBCNDGKVHLVVFNNLQLADGRRH 

RILLRLSNLQRGAGSLELYLDCIQVDSVHNLPRA 

FAGPSQKPETIELRTFQRKPQDFLEELKLVVRGSL 

FQVASLQDCFLQQSEPLAATGTGDFNRQFLGQM 

TQLNQLLGEVKDLLRQEVNETSFLRNTITECQAC 

GPLKFQSPTPSTVVPPASPAPPTRPPRRCDSNPCF 

RGVQCTDSRDGFQCGPCPEGYTGNGITCIDVDEC 

KYHPCYPGEHCINLSPGFRCDACPVGFTGPMVQ 

GVGISFAKSNKQVCTDIDECRNGACVPNSICVNT 

LGSYRCGPCKPGYTGDOIRGCKAERNCRNPELN 

PCSVNAQCIEERQGDVTCVCGVGWAGDGYICGK 

DVDIDSYPDEELPCSARNCKKDNCKYVPNSGQE 

DADRDGIGDACDEDADGDGILNEODNCVLIHNV 

DQKNSDKDEFGDACDNCLSVLNNDQKDTDGDG 

RGDACDDDMDGDGIKNILDNCPKFPNRDQRDK 

DGDGVGDACDSCPDVSNPNQ 


3709 


A 


144 


417 


TQAMEGLLHYINPAHAISLLSALNEERLKGQLCD 
VLLIVGDOKFRAHKNVLAASSEYFOSLFTNKENE 

SQTVFQLDFCEPDAFDNVLNYIY 


3710 


A 


245 


688 


FGMLK>JKGHSSKKDNLAVNAVALQDHILHDLQ 

LRNLSVADHSKTQVQKKENKSLKRDTKAUDTGL 

KKTTQCPKLEDSEKEYVLDPKPPPLTLAQKLGLI 

GPPPPPLSSDEWEKVKQRSLLQGDSVQPCPICKE 

EFELRPQVFSIRG 


3711 


A 


3 


773 


SLEMSSDGEPLSRMDSEDSISSTIMDVDSTISSGRS 
TPAMMNGOGSTTSSSKNIAYNCCWDOCOACFNS 
SPDLADfflRSIHVDGQRGGVFVCLWKGCKVYNT 
PSTSQSWLQRHMLTHSGDKPFKCVVGGCNASFA 
SQGGLARHVPTHFSQQNSSKVSSQPKAKEESPSK 
AGMNKRRKLKNKRRRSLARPHDFFDAQTLDAIR 
HRAICFNLSAHIESLGKGHSVVFHSTVSILLFFQIK 
YKTLQKNISTIISKSLKI 


3712 


A 


2 


344 


RATWHNAGKEREAVQLMAGAEKRVKASHSFLR 
GLFGG>rmEEACEMYTRAANMFmAKNWSAA 
GNAFCQAAKLHMQLQSKHDSATSFVDAGNAYK 
KADPQGKTARHVACYLCV 


3713 


A 


20 


974 

« 


GAAATACSSSSSSSGAPATWAAHGPGKDVASPS 

SVSLSPRRSRLLVLRCGLRRNPERPSSSPALRRLL 

LLLLLLLLLLLGFLLSPGPERGVGGGRFGRRLAL 

LWAAALGHVVSGKVMSRRAPGSRLSSGGGGGG 

TNYSRSWNDWQPRTDSASADPGNLKYSSSRDRG 

GSSSYGLQPSNSAWSRQRHDDTRVHADIQNDE 

KGGYSVNGGSGENTYGRKSLGQELRVNNVTSPE 

FTSVQHGSRALATKDMRKSQERSMSYCDESRLS 

YLLRRITRENDRDRRLATVKQLKEFIQQPENKLV 

LVKQLDILAAVHDVLNER 


3714 


A 


237 


458 


IFALKSPSYLLPCCTPEGKMDHKQLCWSHPQKSG 

QSSRSCCICSNQHGLIWKYSLNMCLQCCHQYVK 

DIGFIKL 


3715 


A 


970 


1524 


LCTLSPGISGTAGSCLTTEPGTELGTSFAQNGFYH 

EAWLFTQALKLNPQDHRLFGNRSFCHERLGQP 

AWALADAQVALTLRPGWPRGLFRLGKALMGLQ 

RFREAAAVFQETLRGGSQPDAARELRSCLLHLTL 

QGQRGGICAPPLSPGALQPLPHAELAPSGLPSLRC 
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SEQU) 
NO: 


Metbod 


Predicted 

beginning 

Ducleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=A]anine OCysteinCt D=A5partic Acid, 
£«=<;iutaniic Add, F^Phenylalanine, G=Glycine, H-Histidine, 
Msoleucine, K-Lyslne, L^Leucine, M-MetbioQine» 
^f=Aspa^agine, P^prollDe, Q^Iutamine, R=Arginine, SBSerine, ^ 
T=:Threonine, V=Valine, W~Tryptophan, Y=Tyrosine, 
X»iln known, *=Stop codon,A=pos5ible nucleotide deletion, 
V»possjble n udeotide Insertion 










PRSTALRSPGLSPLLH 


3716 


A 


85 


308 


QGLPSTMVKLGCSFSGKPGKDPGDQDGAAMDS 

VPLISPLDISQLQPPLPDQVVnCTQTEYQLSSPDQQ 

ISfYTKSR 


3717 


A 


58 


618 


GAGCTSPGLWARKAAARCLPTYPSRAQPSNVGR 

RMJRRPGLGALAAGVPAMAESVERLQQRVQELE 

RELAQERSLQVPRSGDGGGGRVRIEKMSSEVYD 

SNPYSRLMALKRMGIVSDYEKIRTFAVAIVGVGG 

VGSVTAEMLTRCGIGKLLLFDYDKVELANMNRL 

FFOPHGAGLSKVOAAGHTPEE 


3718 


A 


3 


593 


RGAGGRAGGRADGQPNMADQRQRSLSTSGESL 

YHVLGLDKNATSDDIKKSYRKLALKYHPDKNPD 

NPEAADKFKEINNAHAILTDATKRNIYDKyGSLG 

LYVAEQFGEENVNTYFVLSSWWAKALFVFCGLL 

TCCYCCCCLCCCFNCCCGKCKPKAPEGEETEFY 

VSPEDLEAQLQSDEREATDTPIVIQPASATEP 


3719 


A 


2 


2173 


SGGVRMGSRADGPRTSGHVTGKMAVFPWHSKN 

RNYKAEFASCRLEAVPLEFGDYHPLKPITVTESK 

TKKVNRKGSTSSTSSSSSSSWDPLSSVLDGTDPL 

SMFAATADPAALAAAMDSSRRKRDRDDNSVVG 

SDFEPWTNKRGEILARYTTTEKLSINLFMGSEKG 

KAGTATLAMSEKVRTRLEELDDFEEGSQKELLN 

LTQQDYVNRIEELNQSLKDAWASDQKVKAPKN 

VHPGKLVYERIFSMCVDSRSVLPDHFSPENANDT 

AKETCLNWFFKIASIRELIPRFYVEASILKCNKFLS 

KTGISECLPRLTCMIRGIGDPL\GSVYARAYL\SRV 

GMEVAPHLKETLNKNFFDFLLTFKQIHGDTVQN 

QLWQGVELPSYLPLYPPAMDWIFQCISYHAPEA 

LLTEMMERCKKLGNNALLLNSVMSAFRAEFIAT 

RSMDFIGMDCECDESGFPKHLLFRSLGLNLALAD 

PPESDRLQILNEAWKVITKLKNPQDYINCAEVWV 

d I 1 v^ivrrr ixviviw v i>i i v Lutxxy v uvnivi i rLJi\T\r CfLJO i 

PQLQLIIKKVrAHFHDFSVLFSVEKFLPFLDMFQK 

ESVRVEVCKCI\RTPLSSINKSPPRTRSS*MPFC1V1F 

ARPCMTL/CNALTLEDEKRMLSYLINGFIKMVSF 

GRDFEQQLSFYVESRSMFCNLEPVLVQLIHSXTNR 

LAMETRKVMKGNHSRKTAAFVRSWGAYWFITIP 

SLAGIFTRLNLYLHSG 


3720 


A 


24 


296 


ENLFRAGFAFSLLRSSF YISKTYCS WFSNLISG SL 

ADFNSKGTRDYSPRQMAVRE/KVFDVIIRCFICRH 

GAEVIDTPVFELKVRNGQEETTW 


3721 


A 


2 


310 


PSCLTCVGHCSIGGSCTMIGIMMPECHCSLHMTG 
PRCEEHVFILQQPGHIASILIPLLVLLLLALVAGVV 
FWHKIOIVQGAKGFQHQRMTNGAMNVEIGKPTY 
K 


3722 


A 


75 


722 


MELVAGCYEOVLFGFAVHPEPEACGDHEOWTL 

VADFTHHAHTASLSAVAVNSRFWTGSKDETIHI 

YDMKKKIEHGALVHHSGTITCLKFYGNRHLISGA 

EDGLICIWDAKKWECLKSIKAHKGQVTFLSIHPS 

GKLALSVGTDKTLRTWNLVEGRSAFIKNIKQNA 

HIVEWSPRGEQYVVIIQNKIDIYQLDTASISGTITN 

EKRISSVKFLSES 


3723 


A 


110 


316 


MELSDNRRSGGLEGLAEKCPNLTYLNLSGNKIK 
DLSTVEALVSGTVLSLDLLFLVKFSEICLCLLISI 


3124 


A 


3 


406 


VDRGTEAWQRDPAFSGLQRVGGVDVSFVKGDS 
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SEQID 
NO: 


Method 


Predicted 

begiDDing 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^Aianine C=Cysteinc, D-Aspartic Acid, 
E>Glutamic Acid, F=Phenylatanine, G^Glycine, H^Histidine, 
I^Isoleucine, K<=Ly$ine) Lr=Leuclne, M>»Methionine, 
N"Asparagine, PsProlinc, Q=Glntaniine, R»Arginine, Serine, 
T^Threonine, V=»Vaiine, W^Tryptopban, Y=Tyrosine, 
X-Unknown, *=Stop codon, /possible nucleotide deletion, 
\-possible nucleotide insertion 










VRACASLGVLSFPELEWYEESRMVSLTAPYVSG 
FLAFREVPFLLELVQQLREKEPGLMPQVLLVDGN 
GVLHHRGFGVACHLGVLTDLPCVGVAKKLLOV 
DG 


3725 


A 


3 


406 


VDRGTEAWQRDPAFSGLQRVGGVDVSFVKGDS 

VRACASLGVLSFPELEWYEESRMVSLTAPYVSG 

FLAFREVPFLLELVQQLREKEPGLMPQVLLVDGlSf 

GVLHHRGFGVACHLGVLTDLPCVGVAKKLLOV 

DG 


3726 


A 


1 


433 


SSDDRSLFRRLKLNYAIFDEGHMLICNMGSIRYQ 
HLMTINANNRLLLTGTPVQNISILLELMSLLNFVM 
PHMFSSSTSEIRRMFSSKTKSADEOSIYEKERIAH 
AKQIIKPFILRRVKEEVLKQLPPKKDRIBLCAMSE 

KQEQLYLG 


3727 


A 


6 


383 


RIPRGKACXTVLGRSTGELEGFASSRLPPQPCGW 
GQSSDLLSRTOLDELMKKDEPPLDFPDTLEGFEY 
AFNEKGQLRHDCTGEPFVFNYREHLHRWNQKRY 
EALGEIITKYVYELLEKDCNSKKVS 


3728 


A 


3 

« 


2452 


EIAGAAAENMLGSLLCLPGSGSVLLDPCTGSTISE 

TTSEAWSVEVLPSDSEAPDLKQEERLQELESCSG 

LGSTSDDTDVREVSSRPSTPGLSVVSGISATSEDIP 

NKIEDLRSECSSDFGGKDS VTSPDMDEITHDFLY I 

LQPKQHFQHIEAEADMRIQLSSSAHQLTSPPSQSE 

SLLAMFDPLSSHEGASAWRPKVHYARPSHPPPD 

PPILEGAVGGNEARLPNFGSPMF*LPAEMEAFKQ 

RHSA^TPERLVRSRSS\DIVSSVRRPMSDPSWNRR 

PVGNEERELPPAAAIGATSLVAAPHSSSSSPSKDSS 

RGETEERKDSDDEKSDRNRPWWRKRFVSAMPK 

APIPFRKKEKQEKDKDDLGPDRFSTLTDDPSPRLS 

AQAQVAEDILDKYRNAIKRTSPSDGAMANYEST 

EVMGDGESAHDSPRDEALQNISADDLPDSASQA 

AHPQDSAFSYRDAKKKLRLALCSADSVAFPVLTV 

HSTRNGLPDHTDPEDNEIVCFLKVQIAEAINLQD 

K>JLMAQLQETMRCVCRFDNRTCRKLLASIAEDY 

RKRAPYIAYLTRCRQGLQTTQAHLERLLQRVLR 

DKEVANRYFTTVCVRLLLESKEKKIREFIQDFQK 

LTAADDKTAOVEDFLOFLYGAMAODVIWONAS 

EEQLQDAQLAIERSVMNRIFKLAFYPNQDGDILR 

DQVLHEHIQRLSKVVTANHRALQIPEVYLREAP 

WPSAQSEIRTISAYKTPRDKVQCILRMCSTIMNLL 

SLANEDSVPGADDFVPVLVFVLIKANPPCLLSTV 

QYISSFYASCLSGEESYWWMQFTAAVEFIKTIDD 

RK 


3729 


A 


3 


2452 


EIAGAAAENMLGSLLCLPGSGSVLLDPCTGSTISE 

TTSEAWSVEVLPSDSEAPDLKQEERLQELESCSG 

LGSTSDDTDVREVSSRPSTPGLSVVSGISATSEDIP 

NKJEDLRSECSSDFGGKDSVTSPDMDEITHDFLYI 

LQPKQHFQfflEAEADMRIQLSSSAHQLTSPPSQSE 

SLLAMFDPLSSHEGASAWRPKVHYARPSHPPPD 

PPILEGAVGGNEARLPNFGSPMF*LPAEMEAFKQ 

RHSA^TPERLVRSRSS\DIVSSVRRPMSDPSWNRR 

PVGNEERELPPAAAIGATSLVAAPHSSSSSPSKDSS 

RGETEERKDSDDEKSDRNRPWWRKRFVSAMPK 

APIPFRKKEKQEKDKDDLGPDRFSTLTDDPSPRLS 

AQAQVAEDILDKYKNADCRTSPSDGAMANYEST 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
oucIeoHde 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino add sequence (A'^Alanine C=Cysteiney D=A$partic Acid, 
£=<>lutaniic Acid, F^^Pbenylalanine, G=^GIycine» H^Histidlne, 
I==Isoleucine, K-Lysine, L^JLeudne, M^Methioaine, 
N"Asparagine, P^^Proline, Q^Giutamlne, R^Argininc, S^erine, 
T»Threonine, V=Valine, W=Tryptoplian, Y«Tyrosine, 
X=Unknown, *=Stop codon, ^possible nucleotide deletion, 
V^possible nucleotide insertion 










EVMGDGESAHDSPRDEALQNISADDLPDSASQA 

AHPQDSAFSYRDAKKKLRLALCSADSVAFPVLT\ 

HSTRNGLPDHTDPEDNEIVCFLKVQIAEAINLQD 

KNLN4AQLQETMRCVCRFDNRTCRKLLASIAEDY 

RKRAPYIAYLTRCRQGLQTTQAHLERLLQRVLR. 

DKEVANRYFTTVCVRLLLESKEKKIREFIQDFQK 

EEQLQDAQLAIERSVMNRIFKLAFYPNQDGDILR 

DQVLHEHIQRLSKVVTANHRALQIPEVYLREAP 

WPSAQSEIRTISAYKTPRDKVQCILRMCSTIMNLL 

SLANEDSVPGADDFVPVLVFVLIKANPPCLLSTV 

QYISSFYASCLSGEESYWWMQFTAAVEFIKTIDD 

RK 


3730 


A 


3 


2452 


EIAGAAAENMLGSLLCLPGSGSVLLDPCTGSTISE 

TTSEAWSVEVLPSDSEAPDLKQEERLQELESCSG 

LGSTSDDTDVREVSSRPSTPGLSWSGISATSEDIP 

NKIEDLRSECSSDFGGKDSVTSPDMDEITHDFLYI 

LQPKQHFQHIEAEADMRIQLSSSAHQLTSPPSQSE 

SLLAMFDPLSSHEGASAWRPKVHYARPSHPPPD 

PPILEGAVGGNEARLPNFGSPMF*LPAEMEAFKQ 

RHSATPERLVRSRSS\DIVSSVRRPMSDPSAVNRR 

PNGNEERELPPAAAIGATSLVAAPHSSSSSPSKDSS 

RGETEERKDSDDEKSDRNRPWWRKRFVSAMPK 

APIPFREOCEKQEKDKDDLGPDRFSTLTDDPSPRLS 

AQAQVAEDILDKYRNAIKRTSPSDGAMANYEST 

EVMGDGESAHDSPRDEALQNISADDLPDSASQA 

AHPQDSAFSYRDAKKKLRLALCSADSVAFPVLT\ 

HSTRNGLPDHTDPEDNEIVCFLKVQIAEAINLQD 

KNLMAQLQETMRCVCRFDNRTCRKLLASIAEDY 

RKRAPYIAYLTRCRQGLQTTQAHLERLLQRVLR 

DKEVANRYFTTVCVRLLLESKEKKIREFIQDFQK 

T TA AnnifTAOVFTlFT nPT Vr; A M AOnVTWOMA C 
Li 1 /\/\lyUlv 1 V cur LA^rLi I \JJ\^Lc\\^lJ V 1 W v^iN/\^ 

EEQLQDAQLAEERSVMNRIFKLAFYPNQDGDILR 

DQVLHEHIQRLSKVVTANHRALQPEVYLREAP 

WPSAQSEIRTISAYKTPRDKVQCrLRMCSTIMNLL 

SLANEDSVPGADDFVPVLVFVLIKANPPCLLSTV 

QYISSFYASCLSGEESYWWMQFTAAVEFJKTIDD 

RK 


3731 


A 


1 


1305 


WTAMHEAKLMEECDELVEnQQRKQMIAVKDC 

ETKVMKLRKLAQQVANCRQCLERSTVLINQAEH 

ILKENDQARFLQSAKNIAERVAMATASSQVLIPDI 

NFNDAFENFALDFSREKKLLEGLDYLTAPNPPSIR 

EELCTASHDTITVHWISDDEFSISSYELQYTIFTGQ 

ANFISLYNSVDSWMIVPNIKQNHYTVHGLQSGTR 

YTPIVKATNOAGSRNSPPTRI KTN^OPPKT nPKMT 

HKKLKISNDGLQMEECDESSLKKSHTPERFSGTGC 

YVYGVLHNSDNS*MFISLSFPLSHRYAIGIAYKSA 

PKl^WIGBCNASSWVFSRCNSNFVVRHNNKEML 

VDVPPHLKRLGVLLDYDNY/NMLSFYDPANSL\H 

LHTFDVTF\ILPVCPTFTIWNKSLM1LSGLPAPDF1 

DYPERQECNCRPQESPYVSGMKTCH 


3732 


A 


127 


2832 


LGQRLSLVPRPSLKRRLGKRLSLGLRERMMSLW 
WS/GPKVRTQATTGARPKTETKSVPAARPKTEAQ 
AMSGARPKTEVQVMGGARPKTEAQGITGARPKT 
DARAVGGARSKTDAKAIPGARPKDEAQAWAQS 
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SEQID 
NO; 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first umino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, O^Aspartic Acid, 
E^GIutamic Acid» P=FhenylaIanine, GKSlycine, H^Histidinc, 
I«IsoIeucine, K«Lysine, L^Leucine, M»Methionioe, 
N"Asparagine» P^Proline, Q^Glutamine, R«Arginine, Serine, 
T»Thrconine, V^Valine, W'=Tryptophan, Y=Tyrosine, 
X=Unknown, *«Stop codon, A^possible nucleotide deletion, 
V^possible nucleotide insertion 










EFGTEAVSQAEGVSQTNAVAWPLATAESGSVTK 

SK\ACLWIEN*SlvrWM/PETFPGTQGQKGIQPWFG 

PGEETNMGSWCYSRPRAREEASNESGFWSADET 

STASSFWTGEETSVRSWPREESNTRSRHRAKHQT 

NPRSRPR5KQEAYVDSWSGSEDEASNPFSFWVG 

ENTNNLFRPRVREEANIRSKLRTNREDCFESESED 

EFYKQSWLPGEEA>mDSGTETKKILILPWKLRA 

QKDVDSDRVKQEPRFEEEVIIGSWFWAEKEASLE 

GGASAICESEPGTEEGAIGGSAYWAEEKSSLGAV 

AREEAKPESEEEAIFGSWFWDRDEACFDLNPCPV 

YKVSDRFRDAAEELNASSRPQTWDEVTVEFKPG 

LFHGVGFRSTSPFGIPEEASEMLEAKPKNLELSPE 

GEEQESLLQPDQPSPEFTFQYDPSYRSVREIREHL 

RARESAESESWSCSCIQCELKIGSEEFEEFLLLMD 

KIRDPFIHEISKIAMGMRSASQFTRDFIRDSGVVS 

LIETLLNYPSSRVRTSFLENMIHMAPPYPNLNMIE 

TFICOVCEETLAHSVDSLEOLTGNKGCFRHLTMT 

IDYm\LIAN*YGPGFPLLF*PQAQCGETKFHVLK 

MLLNLSENPAVAKKLFSAKALSIFVGLFNIEETN 

DNIQIVIKMFQNISNIIKSGKMSLIDDDFSLEPLISA 

FREFEELAKQLQAQIDNQNDPEATGTTAFVGKG 

NNPSANRERLSPSVFCPGAQEAESLPARRVRGEE 

QRLLLEEVGARTADGIPEGW 


3733 


A 


2 


3274 


DVPLIRIEEDTGEIFTTGARIDREKLCAGIPRDEHC 

FYEVEVAILPDEIFRLVKIRFLffiDINDNAPLFPAT 

VmiSIPENSAINSICYTLPAAVDPDVGINGVQNYE 

LKSQNIFGLDVIETPGGDKMPQLIVQKELDREEK 

DTYVMKVKVEDGGFPQRSSTAILQVSVTDTNDN 

HPVFKETEDSVSIPENAPVGTSVTQLHATDADIGE 

NAKIHFSFSKLVSNIARRLFHLNATTGLITIKEPLD 

REETPNHKLLVLASDGGLMPARAMVLVNVTDV 

NDNVPSmiRYIVNPWDTVVLSENIPLNTKIALIT 

VTDKDADHNGRVTCFTDHEIPFRLRPVFSNQFLL 

ETAAYLDYESTKEYAIKLLA\ADAGKPPLNQSAM 

LFIKVKDENDNAPVFTQSFVTVSIPENNSPGIQLT 

KVSAMDADSGPNAKINYLLGPDAPPEFSLDCRT 

GMLTVVKKLDREKEDKYLFTILAKDNGVPPLTS 

NVTVFVSIIDQNDNSPVFTHNEYNFYVPENLPRH 

GTVGLITVTDPDYGDNSAVTLSILDENDDFTIDSQ 

TGVIRP>aSFDREkQESYTFYVKAEDGGRVSRSSS 

AKVTINVVDVNDNKPVFIVPPSNCSYELVLPSTN 

PGTVVFQVLAVDNDTGMNAEVRYSIVGGNTRDL 

FAIDQETGNITLMEKCDVTDLGLHRVLVKANDL 

GQPDSLFSVYIVNLFVNESVTNATLINELVPQKH 

LKIiQ*PQILEIADVSSPTSDYVKILVAAVAGTITV 

VWIFITAVVRCRQAPHLKAAQKNMQNSEWATP 

NPENRQMIMMKKKKJCKKKHSPKM.LLNVVm 

TKADDVDSDGNRVTLDLPIDLEEQTMGKYNWV 

TTPTTFKPDSPDLARHYKSASPQPAFQIQPETPLN 

LKHffllQELPLDNTFVACDSISNCSSSSSDPYSVSD 

CGYPVTTFEVPVSVHTRPPVDLEVGGAQSGQVAI 

LTSSLMELLLCLMVAAFLPLELRPLGQQNVMSW 

EQEAKDLLVGYWGDGEWCHFHFHHLEPGPVNPG 

YEKKQYHILDSDSEDTQPSGELCPIPVRPFTILSIQ 

LLQDDGEHCGTKQGFQPAVQLGLLPHKTLK 
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SEQID 
NO: 


Method 


Predicted 

begiDfliag 

nucleotide 

location 

corresponding 

to first aroioo 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C^'Cysteine, D^Aspartic Acid, 
E==Glutamic Acid, F^Phenylalanine, &=Ciycine, H^Histidlne, 
I^lsoleucine, K^Lysine, L»Leucine, M'=Methionine, 
N=Asparagine, P=Prolinc, Q=>Glut8mlne, R°Arginine, S^erine* 
T^TIireonine, V«Valine, W«Tryptophan, Y-O'yrosiDe, 
X-Unlcnown, *«Stop codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion 


3734 


A 


I 


840 


GTRPGHLPAPSDGFC V/HL* SPSWGSF*GESL/EM 

QLHSLGLQEFDIARNVLELIYAQTLVWIGIFFCPL 

LPFIOMIMLFIMFYSKMSLMMNFOPPSKAWRAS 

QMMTFFIFLLFFPSFTGVLCTLAITIWRLKPSADC 

GPFRGLPLFIHSIYSWIDTLSTRPGYLWVVWIYRN 

LIGSVHFFFILTLIVLIITYLYWQITEGRKIMIRLLH 

EQDNEGKDKMFLIEKLIKLQDMEKKANPSSLVLE 

RREVEOOGFLHLGEHDGSLDLRSRRSVOEGNPR 

A 


3735 


A 


2 


432 


VEVCRRYLWKMTVDASQNVQCCVIFSHFPFIFN 

NLSKJKLLHTDTLLKffiSKKHKAYLRSAAIEEERE 

SEFALRPTFDLTVRRNHLIEDVLNQLSQFENEDL 

RKELWVSFSGEIGYDLGGSA^KKEIFYCLFAEMIQ 

PEYGMFMY 


3736 


A 


1542 


343 


KGAPSFVJRLYQYPNFAGPHAALANKSFFKADKV 

TMLWNKKATAVLVIASTDVDKTGASYYGEQTL 

HYIATNGESAWQLPKNGPIYDWWNSSSTEFCA 

VYGFMPAKATIFNLKCDPVFDFGTGPRNAAYYS 

PHGHILVLAGFGNLILQI*AD/IMKVWNVKNYKLI 

WPVA^"n^TVFAWPPnfiPHTl TA TP A PPT P VKM 

GYKIWHYTGSILHKYDVPSNAELWQVSWQPFLD 

GIFPAKHTYQAVPSEVPNEEPKVATAYRPPALRN 

KPITNSKUffiEEPPQNMKPQSGNDKPLSKTALKN 

QRKHffiAKKAAKQEARSDKSPDLAPTPAPQSTPR 

l^SQSISGDPEEDKKDCNLKKKLKAIEQLKEQAA 

TGKQLEKNQLEKIQKETALLQELEDLELGI 


3737 


A 


3190 


664 

• 


. VAMGTPRAQHPPPPQLLFLILLSCPWIQGLPLKEE 
EILPEPGSETPTVASEALAELLHGALLRRGPEMG 
YLPGPPLGPEGGEEETTTTirrmV 1 TTVTSPVLC 
NNNISEGEGYVESPDLGSPVSRTLGLLDCTYSIHV 
YPGYGIEIQVQTLNLSQEEELLVLAGGGSPGLAP 
RLLANSSMLGEGQVLRSPTNRLLLHFQSPRVPRG 
GGFRIHYQAYLLSCGFPPRPAHGDVSVTDLHPGG 
TATFHCDSGYQLQGEETLICLNGTRPSWNGETPS 
CMASCGGTIHNATLGRIVSPEPGGAVGPNLTCR 
WVIEAAEGRRLHLHFERVSLDEDNDRLMVRSGG 
SPLSPVIYDSDMDDVPERGLISDAQSLYVELLSET 
PANPLLLSLRFEAFEEDRCFAPFLAHGNVTTTDPE 
YRPGALATFSCLPGYALEPPGPPNAIECVDPTEPH 
WNDTEPACKAMCGGELSEPAGVVLSPDWPQSY 
SPGQDCVWGVHVQEEKRILLQVEILNVREGDML 
TLFDGDGPSARVLAQLRGPQPRRRLLSSGPDLTL 
QFQAPPGPPNPGLGQGFVLHFKEVPRNDTCPELP 
PPEWGWRTASHGDLIRGTVLTYQCEPGYELLGS 
DILTCOWDLSWSAAPPACOICIMTCADPGEIANG 
HRTASDAGFPVGSHVQYRCLPGYSLEGAAMLTC 
YSRDTGTPKWSDRVPKCALKYEPCLNPGVPENG 
YQTLYKHHYQAGESLRFFCYEGFELIGEVTITCV 
PGHPSQWTSQPPLCKVTQTTDPSRQLEGGNLAL 
ADLLPLGLVIVLGSGVYIYYTKLQGKSLFGFSGSH 
SYSPITVESDFSNPLYEAGDTREYEVSI 


3738 


A 


3190 


664 


VAMGTPRAQHPPPPQLLFLILLSCPWIQGLPLKEE 
EILPEPGSETPTVASEALAELLHGALLRRGPEMQ 
YLPGPPLGPEGGEEETTTTUTTTTNTITrVTSPVLC 
NNNISEGEGYVESPDLGSPVSRTLGLLDCTYSIHV 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

locatioii 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide , 
sequence 


Amino acid sequence (A=Alanine C=Oysteine, D='Aspartic Acid, 
E==GIutaniic Add, F^Phenylalanine, G=Giycine, H^Histidine, 
I»Isoleucine, K=L>sine, L^^Leucine, M»Metiiionine, 
N=>Asparagine, P«ProUne, Q<=G]utanifne, R^Arginine, S=Serine, 
T»Tlireonine, V«Valine, W-Tryptophan, Y»Tyrosine, 
X-Unlcnown, *«Stop codon, /^possible nucleotide deletion, 
\~po8Sible nucleotide insertion 










YPGYGIEIQVQTLNLSQEEELLVLAGGGSPGLAP 

RLLANSSMLGEGQVLRSPTNRLLLHFQSPRVPRG 

GGFRIHYQAYLLSCGFPPRPAHGDVSVTDLHPGG 

TATFHCDSGYQLQGEETLICLNGTRPSWNGETPS 

CMASCGGTIHNATLGRIVSPEPGGAVGPNLTCR 

WVIEAAEGRRLHLHFERVSLDEDNDRLMVRSGG 

SPLSPVIYDSDMDDVPERGLISDAQSLYVELLSET 

PANPLLLSLRFEAFEEDRCFAPFLAHGNVTTTDPE 

YRPGALATFSCLPGYALEPPGPPNAIECVDPTEPH 

WNDTEPACKAMCGGELSEPAGVVLSPDWPQSY 

SPGQDCVWGVHVQEEKRILLQVEILNVREGDML 

TLFDGDGPSARVLAQLRGPQPRRRLLSSGPDLTL 

QFQAPPGPPNPGLGQGFVLHFKEVPRNDTCPELP 

PPEWGWRTASHGDLIRGTVLTYQCEPGYELLGS 

HRTASDAGFPVGSHVQYRCLPGYSLEGAAMLTC 
YSRDTGTPKWSDRVPKCALKYEPCLNPGVPENG 
YOTLYKHHYOAGESLRFFCYFGFFLTGEVTTTrV 

PGHPSQWTSQPPLCKVTQTTDPSRQLEGGNLAL 

AILLPLGLVIVLGSGVYIYYTKLQGKSLFGFSGSH 

SYSPITVESDFSOTLYEAGDTREYEVSI 


3739 


A 


734 


445 


LLEPEPAEEYTEQSEVEST/EGMILI*CCLYFAAFQ 
TNVSNIYFALQYYNRQFMAETQFTSGEKEQVDE 
WTVETVEVRVLCIAKLLSLSSVSNFYLY 


3740 


A 


2 


1578 


MAHYITFLCMVLVLLLQNSVLAEDGEVRSSCRT 

APTDLVFILDGSYSVGPENFEIVKKWLVNITKNF 

DIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTA 

AVESILYLGGNTKTGKAIQFALDYLFAKSSRFLT 

KIAWLTDGKSQDDVKDAAQAARDSKITLFAIG 

VGSETEDAELRAIANKPSSTYVFYVEDYIAISKIR 

EVMKQKLCEESVCPTRIPVAARDERGFDILLGLD 

VNKKVKKRIQLSPKKIKGYEVTSKVDLSELTSNV 

FPEGLPPSYVFVSTQRFKVKKIWDLWRILTIDG/* 

KTLFDEGWHQIRLLVTEQDVTLYIDDQQIENKPL 

HPVLGILINGQTQIGKYSGKEETVQFDVQKLRIY 

CDPEQNNRETACEIPGFCLNGPSDVGSTPAPCICP 

PGKPGLQGPKGDPGLPGNPGYPGQPGQDGKPVS 

TESLVISGISGITGYQGIAGTPGVPGSPGIQGARGL- 

PGYKGEPGRDGDK 


3741 


A 

4 


5048 


1236 


MSAPAGSSHPAASARIPPKFGGSAVSGAAAPAGP 

GAGPAPHQQNGPAQNQMQVPSGYGLHHQNYIA 

PSGHYSQGPGKMTSLPLDTQCGDYYSALYTVPT 

QNVTPNTVNQQPGAQQLYSRGPPAPHIVGSTLGS 

FQGAASSASHLHTSASQPYSSFVNHYNSPAMYS 

ASSSVASQGFPSTCGHYAMSTVSNAAYPSVSYPS 

LPAGDTYGQMFTSQNAPTVRPVKDNSFSGQNTA 

ISHPSPLPPLPSQQHHQQQSLSGYSTLTWSSPGLP 

STODNLIRNHTGSLAVANNNPTITVADSLSCPVM 

QNVQPPKSSPWSTVLSGSSGSSSTRTPPTANHPV 

EPVTSVTQPSELLQQKGVQYGEYVNNQASSAPT 

PLSSTSDDEEEEEEDEEAGAn^SSSTTSSASPMPNS 

YDALEGGSYPDMLSSSASSPAPDPAPEPDPASAP 

APASAPAPWPQPSKMAKPLAMAIQHFSLVIRNCL 

QHHLFLEYSPSNPVYSGFQQYPQQYPGVNQLSSS 
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1 SEQ ID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C=Cysteine, D»Aspartic Acid, 
E=Glutaiiiic Acid, F^Phenylalanine, &=Glycine, H<=Histidine, 
I^Isoleuclne, K»Lysine, L=^Leucine, M-Methionine, 
N°Asparagine, P=»Proline, Q^GIutamine, R=Atginine, SaSerine, 
T»Tbreonine, V=VaIlne, W«Tryptophan, Y-Tyrosine, 
X=Unlino>vn, *»Stop codon, ^possible nucleotide deletion, 
\ppossible nucleotide insertion 










IGGLSLQSSPQPESLRPVNLTQERNELPMTPVWAP 

VPNLNADLKKLNCSPDSFRCTLTNIPQTQALLNK 

AKLPLGLLLHPFRDLTQLPVITSNTIVRCRSCRTYI 

NP\FVSFIDQRR*KCNLCYRVNDVPEEFMYNPLT 

RSYGEPHKRPEVQNS\TVEFIASSDYMLRPPQPAV 

YLFVLDVSHNAVEAGYLTI/LWCQSLLE\NLDKLP 

G\DSRT\RIGFMTFD\STYSFLQFTQEGLSQPQMLI 

VSDIDDVFLPTPDSLLVNLYESKELIKDLLNALPN 

MFTNTRETHSALGPALQAAFKLMSPTGGRVSVF 

QTQLPSLGAGLLQSREDPNQRSSTKVVQHLGPAT 

DFYKKLALDCSGQQTAVDLFLLSSQYSDLASLA 

CMSKYSAGCIYYYPSFHYTHNPSQAEKLQKDLK 

RYLTRKIGFEAVMRIRCTKGLSMHTFHGNFFVRS 

TDLLSLANINPDAGFAVQLSIEESLTDTSLVCFQT 

ALLYTSSKGERRIRVHTLCLPVVSSLSDVYAGVD 

VQAAICLLANMAVDRSVSSSLSDARDALVNAW 

LKQKAFRTGTSTRLDDRVYAMCQIKSQPLVHLM 

KMIHPNLYRIDRLTDEGAVHVNDRIVPQPPLQKL 

SAEBaTREGAFLMDCGSVFYIWVGKGCDNNFIE 

DVLGYTOFASIPQKMTHLPELDTLSSERARSFIT 

WLRDSRPLSPILHIVKDESPAKAEFFQHLffiDRTE 

AAFSYYEFLLHVQQQICK 


3742 


A 


934 


68 


SMLASQGVLLHPYGVPMIVPAAPYLPGLIQGNQE 
AAAAPDTMAQPYASAQFAPPQNGIPAEYTAPHP 

rlr AJril 1 i VJV^ 1 i VJriinlJjINJj i JrJr/\^^ inoriV^o 

SAQTVSGTRNKQD*RSTDGWPSPKTQTS*KHGk 

QVSSPSGLHVSNIPFR\FRDPDLRQMF\GQFGKILD 

VEIIFNERGSKGFGFVTFENSADADRAREK\LHGT 

VV\EGRKI\EV>nNATARV]VmsrKKTVNPYTNGWK 

LNPVVGAVYSPEFYAGTVLLCQANQEGSSMYSA 

PSTDFRGAKLHTSRPLLSGS 


3743 


A 


3 


1456 


QFQQAWMQNKVPIPAPNEVLNDRKEDIKLEEKK 

KTQAEIEQEMATLQYTNPQLLEQLKIERLAQKQV 

EQIQPPPSSGTPLLGPQPFPGQGPMSQIPQGF/PTA 

PSISADANEHGSXKGPPGPQGQFRPPGPQGQMGP 

QGPPLHQGGGGPQGFMGPQGPQGPPQGLPRPQD 

MHGPQGMQRHPGPHGPLGPQGPPGPQGSSGPQG 

HMGPQGPPGPQGHIGPQGPPGPQGHLGPQGPPGT 

QGMQGPPGPRGMQGPPHPHGIQGGPGSQGIQGP 

MGHPPQEMRGPHPPGGLLGHGPQEMRGPQEIRG 

MQGPPPQGSMLGPPQELRGPPGSQSQQGPPQGSL 

GPPPQGGMQGPPGPQGQQNPARGPHPSQGPIPFQ 

QQKTPLLGDGPRAPFNQEGQSTGPPPLIPGLGQQ 

GAQGRIPPLNPGQGPGPNKVS/ERGAPPRHEGRA 

PPRGRDGFPGPMKTLV 


3744 


A 


1571 


652 


PLTGRKCPGWTHSGSRRSPRIAEEVPGFPKRAEA 
SROFSETADRLELLRRAVMAAARATTPADGEEP 

apeaealaaarerssrflsglelvkqgaearvfr 

grfqgrXavikhrfpkgyrhpalearlgrrrtv 

qearallrcrragisapvvffvdyasnclymeei 

egsvtvrdufsplwrlkktpqglsnlaktigqvl 

armhdedlihgdlttsnmllkppleqlnivlidf 

GLSFISALPEDKGVDLYVLEKAFLSTHPNTETVFE 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresnondi Off 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C^^Cystelne^ D^Aspartic Acid, 
E^GIutainic Acid, F^Phenylalanine, G"Glycine, II=Histidine, 
I-Isoleucine, KpLysine, L^Leucine, M-Methionine, 
N^Asnflrflffiiip PsProline. O^GIiitanine. R=Ar0inin«» ^o^Qprinp 

T»Threonine, V^Vallne, W»Tryptoplian, Y^Tyrosine, 
X=Unknown, *«Stop codon, /^possible nucleotide deletion, 
\=possible nudeotide insertion 










AFLKSYSTSSKKARPVLKKLDEVRLRGKKRSMV 
G 


3745 


A 


127 


1433 


GSHRFSLASPLDPEVGPYCDTPTMRTLFNLLWLA 

LACSPVHTTLSKSDAKKAASKTLLEKSQFSDKPV 

QDRGLWTDLKAESWLEHRSYCSAKARDRHFA 

GDVLGYVTPWNSHGYDVTKVFGSKFTQISPVWL 

QLKRRGREMFEVTGLHDVDQGWMRAVRKHAK 

GL\P*CLGSCLRTGLTMISGA^LDSEDEIEELSKT 

WQVAKNQHFDGFVVEVWNQLLSQKRVGLIHM 

LTHLAEALHOARLLALLVIPPATTPfiTnOT HMFT 

HKEFEQLAPVLDGFSLMTYDYSTAHQPGPNAPL 

SWVRACVQVLDPKSKWRSKBLLGLNFYGMDYA 

TSKDAREPVVGARYIQTLKDHRPRMVWDSQVSE 

HFFEYKKSRSGRHWFYPTLKSLQVRLELARELG 

VGVSIWELGQGLDYFYDLL*VGIAASAVDVFFSK 

PWSE 


3746 


A 


1 


898 


IDRAAECRTKPLPMAVSIRGNADSIVACLYLMVL 
YLIKKRLVACAAVFYGFAVHMKIYPETYILPITL 

ALMFVAVAGLTFFALSFGFYYEYGWEFLEHTYF 

YHLTRRDIRHNFSPYFYMLYLTAESKWSFSLGIA 

AFLPQLILLSAVSFAYYRDLVFCWFLHTSIFVTFN 

KVCTSQYFLWYLCLLPLVMPLVRMPWKRAWL 

LNELWFIGQAMWLAPAYVLEFQGBCNTFLFIWLA 

GLFFLLINCSILIQIISHYKEEPLTERIKYD 


3747 


A 


1 


2325 

• 


MVISFQGLVTFGDVAVDFSQEEWEWLNPIQRNL 

YRKVMLENYRNLASLGLCVSKPDVISSLEQGKEP 

WTVKRKMTRAWCPDLKAVWKIKELPLKKDFCE 

GKLSQAVITERLTSYNLEYSLLGEHWDYDALFET 

QPGLVTIKNLAVDFRQQLHPAQKNFCKNGIWEN 

NSDLGSAGHCVAKPDLVSLLEQEKEPWMVKREL 

TGSLFSGQRSVHETQELFPKQDSYAEGVTDRTSN 

TKLDCSSFRENWDSDYVFGRKLAVGQETQFRQE 

PITHNKTLSKERERTYNKSGRWFYLDDSEEKVH 

NRDSIKNFQKSSVVIKQTGIYAGKBCLFKCNECKK 

TFTQSSSLTVHQRIHTGEKPYKCNECGKAFSDGS 

SFARHQRCHTGKKPYECIECGKAFIQNTSLIRHW 

RYYHTGEKPFDCIDCGKAFSDfflGLNQHRRIHTG 

EKPYKCDVCHKSF\RYGSSLTVHQRIHTGEKPYE 

CDVCRKAFSHHASL-nQVHQRVHSGEKPFKCKEC 

GKAFRQNIHLASHLRIHTGEKPFECAECGKSFSIS 

SQLATHQRIHTGEKPYECKVCSKAFTQKAHLAQ 

HOKTHTGEKPYECKECGKAFSOTTHLIOHORVH 

TGEKPYKCMECGKAFGDNSSCTQHQRLHTGQRP 

YECIECGKAFKTKSSLICHRRSHTGEKPYECSVC 

GKAFSHRQSLSVHQRIHSGKKPYECKECRKTngi 

GHLNQHKRVHTGERSYNYKKSRKVFRQTAHLA 

HHQRIHTGESSTCPSLPSTSNPVDLFPKFLWNPSS 

LPSP 


3748 


A 


823 


1 


GGYTKSGYDSACKDFVPHDLEVQIPGRVFLVTG 

GNSGIGKATALEIAKRGGTVHLVCRDQAPAEDA 

RGEIIRE\SGNQNIFLHIVDLSDPKKIWKFVENFKQ 

EHKLHVLVVIWAGCMVNKREAHKJCMDFEIO^ 

CQYSGVCTFLTTRPDPLCWRKNTDPRVmvSSG 

GMLVQKLNNQ*SPVRKNTIWMGTMVYAQNKVS 
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SEQIB 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=>Alaninc C=Cysteinc, D=Aspartic Add, 
£=Olutamic Acid, F^^Ptienylalanine, G«Glycine, H^Histidine, 
I=lsoleuciQe, K=Lysine, L^Leucine» M— Methionine, 
N»AsparagIne, F=>Proiine, Q»Gluf amine, R^Arginine, S<=Serine, 
T^Tlireonlne, V«Vallne, W-Tryptoplian, Y-Tyrosine, 
X-Uttknown, *«^top codon, ^=pos5ible nucleotide deletion, 
Npposslble nucleotide insertion 










ERQQVVLT\ERWGPRAPG\IHFSS1VEHPGWA\DTPG 
VRQAMPGFHVQASGYRLRSEAQGADTMLWLAL 
SSARSRTAQRP 


3749 


A 


1939 


715 


GFLRLSQAT\RQRLSIPVMVLTLDPTRD\QCFGDR 

FSRLLLDEFLGYDDIL\MSSVfCGLAENEENKGFLR 

NVVSGEHYRFVXSMWMAR-nSYLAAFANHGQSF 

TLSVSHACCGYSHHQIFVFIVDLLQMLEMNMAIA 

FPAAPLLTVILALVGMEAJMSEFFNDTTTAFYIILI 

VWLADQYDAICCHTSTSKRHWLRFFYLYHFAFY 

AYHYRFNGOYSSLALVTSWLFIOHSMIYFFHHYE 

LPAILQHVRIQ\EMLLQAPTLGPGTPTA\LPDDMN 

NNSGAPATAP\DSAGQPPALGPVSPGASGSPGPV 

AAAPSSLVAAAASVAAAAGGDLGWMAETAAIIT 

DASFLSGLSASLLERRPASPLGPAGGLPHAPQDS 

VPPSDSAASDTTPLGAAVGGPSPASMAPTEAPSE 

VGS 


3750 


A 


2 


844 

• 


GLLEPFSKLLSFVIQNAVFTLAYLVELCGLCYRA 
FTKERDKFYLSRSVVLELLQALKLKSPLPDTNLL 
LLVOnCADAGTKLAESTILSKOMIASVPGrnTA 

AMECVRQYIhreVLDFM\ADMHTLTKLKSHMKTC 

SQPLHEDTFGGHLKVGLAQIAAMDISRGNHRDN 

KAVIRYLPWLYHPPSAMQQGPKEFIECVSHIRLL 

SWLLLGSLTHNAVC/LKWPPLPGLPIPLDAGSHV 

ADHLIVILIGFPEOSKTSVL\HMCSLFHAF\SLAOL 

WDSLLARQSGRW 


3751 


A 


431 


2 


AFTRKCEETAFIVPQCEIIPTE/WVCRRIPTGSSLER 

NPGVKEGCEFCPPKVEMFFKDDANHDPQWSRQ 

QLIAAKFGFAALGI/QTEVDIMSHAT*AVFEIPEKS 

RLVPQNCTPVDMKIEFGVHVTSKEILTDVIDNDS* 

RHSPS 


3752 

• 


A 


131 


1278 


AWSGSGLLVLCINTASMPMISVLGKMFLWQREG 

PGGRWTCQTSRRVSSDPAWAVEWELPRGLSLSS 

LGSARTLRGWSRSSRPSSVDSQDLPEVNVGDTV 

AMLPKSRRALTIQEIAALARSSLHGISQWKDHV 

TKPTAMAQGRVAHLIEWKGWSKPSDSPAALESA 

FSSYSDLSEGEOEARFAAGVAEOFAIAEAKLRA 

WSSVDGEDSTDDSYDEDFAGGMDTDMAGQLPL 

GPHLQDLFTGHRFSRPVRQGSVEPESDCSQTVSP 

DTLCSSLCSLEDGLLGSPARLA\PSCWAMSCFSPN 

CPPAGKVPSAAW/APLEAQDSLYNSPLTESCLSP 

AEEEPAPCKDCQPLCPPLTGSWERQRQASDLASS 

GVVSLDEDEAEPEEQ 


3753 


A 


3 


1138 


YYSSVRQRVTCEEPRFRECAAALIEGSATEVYAG 

EWRADRRSGFGVSQRSNGLRYEGEWLGNRRHG 

YGRTTRPDGSREEGKYKRNRLVHGGRVRSLLPL 

ALRRGKVKEKVDRAVEGARRAVSAARQRQEIA 

AARAADALIiCAVAASSVAEKAVEAARMAKLIA 

ODLOPMLEAPGRRPRODSEGSDTEPLDEDSPGV 

YENGLTPSEGSPELPSSPASSRQPWRPPACRSPLP 

PGGDQGPFSSPKAWPEEWGGAGAQAEELAGYE 

AEDEAGMQGPGPRDGSPLLGGCSDSSGSLREEE 

GEDEEPLPPLRAPAGTEPEPIAMLVLRGSSSRGPD 

AGCLTEELGEPAATERPAQPGAANPLVVGAVAL 

LDLSLAFLFSQLLT 


3754 


A 


2 


3338 


SSLLEKMTSSDKJDFRFMATSDLMSELQKDSIQLD 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^Alaninc C=Cysteinc, D»Aspartic Acid, 
E'=Giutamh Acid, F=PhcByIalanine, G=Glycine, H^Histidioe, 
I-Isoieucine, K=Lysine, L=Lcucine, M^Mettiionine, 
N^Asparaglne, P-Proilne, Q=<:intamine, R='Argininc, SaScrine, 
l^Threonine, V^Vaiine, W«Tryptophan, Y^Tyrosine, 
X^Unlinoivn, *=Stop codon, ^"possible nucleotide deletion* 
V^possiblc nucleotide insertion 




• 






EDSERKVVKMLLRLLEDKNGEVQKLAVKWLGV 

PLGAFHASLLHCLLPQLSSPRLAVRKRAVGALGH 

LATACSTDLFVELADHLLDRLPGPRVPTSPTAIRT 

LIQCLGSVGRQAGHRLGAHLDRLVPLVEDFCNL 

DDDELRESCLQAFEAFLRKCPKEMGPHVPNVTS 

LCLQYIKHDPNYNYDSDEDEEQMETEDSEFSEQE 

SEDEYSDDDDMSWKVRRAAAKCIAALISSRPDL 

LPDFHCTLAPVLERRFKEREENVKADVFTAYIVL 

LRQTRPPKGWLEAMEEPTQTGSNLHMLRGQVPL 

WKALQRQLKDRSVRARQGCFSLLTELAGVLPG 

SLAEHMPVLVSGIIFSLADRSSSSTIRMDALAFLQ 

GLLGTEPAEAFHPHLPILLPPVMACVADSFYKIA 

AEALWLQELVRALWPLHRPRMLDPEPYVGEMS 

AVTLARLRATDLDQEVKERAISCMGHLVGHLGD 

RLGDDLEPTLLLLLDRLRNEITRLPAIKALTLVAV 

SPLQLDLQPaAEALHILASFLRKNQRALRLATLA 

ALDALAQSQGLSLPPSAVQAVLAELPALVNESD 

MHVAQLAVDFLATVTQAQPASLVEVSGPVLSEL 

LRLLRSPLLPAGVLAAAEGFLQALVGTRPPCVDY 

AKLISLLTAPVYEQAVDGGPGLHKQVFHSLARC 

VAALSAACPQVEAESTASRLVCDARSPHSSTGVK 

VLAFLSLAEVGQVAGPGHERELKAVLLEALGSPS 

EDVRAAASYALGRVGAGSLPDFLPFLLEQIEAEP 

RRQYLLLHSLKEALGAAQPDSLKPYAEDIWALL 

FQRCEGAEEGTRGWAECIGKLVLVNPSFLLPRL 

SFIAVHNKPSLVRDLLDDILPLLYQETKIRRDLIRE 
VEMGPFKHTVDDGLDVRKAAFECMYSLLESCLG 
QLDICEFLNHVEDGLKDHYDIRMLTFIMVARLAT 
LCPAPVLQRVDRLIEPLRATCTAKVKAGSVKQEF 
EKQDELKRSAMRAVAALLTIPEVGKSPIMADFSS 
QIRSNPELAALFESIQKDSTSAPSTDSMELS 


3755 


A 


2 


3338 


SSLLEKMTSSDKDFRFMATSDLMSELQKDSIQLD 

EDSERKVVKJV1LLRLLEDKNGEVQNLAVKWLGV 

PLGAFHASLLHCLLPQLSSPRLAVRKRAVGALGH 

LATACSTDLFVELADHLLDRLPGPRVPTSPTAIRT 

LIQCLGSVGRQAGHRLGAHLDRLVPLVEDFCNL 

DDDELRESCLQAFEAFLRKCPKEMGPHVPNVTS 

LCLQYBCHDPNYNYDSDEDEEQMETEDSEFSEQE 

SEDEYSDDDDMSWKVRRAAAKCIAALISSRPDL 

LPDFHCTLAPVLERRFKEREENVKADVFTAYIVL 

LRQTRPPKGWLEAMEEPTQTGSNLHMLRGQVPL 

VVKALQRQLKDRSVRARQGCFSLLTELAGVLPG 

SLAEHMPVLVSGIIFSLADRSSSSTIRMDALAFLQ 

GLLGTEPAEAFHPHLPILLPPVMACVADSFYKIA 

AEALWLQELVRALWPLHRPRMLDPEPYVGEMS 

AVTLARLRAIDLDQEVKERAISCMGHLVGHLGD 

RLGDDLEPTLLLLLDRLRNEITRLPAIKALTLVAV 

SPLQLDLQPILAEALHILASFLRKNORALRLATLA 

ALDALAQSQGLSLPPSAVQAVLAELPALVNESD 

MHVAQLAVDFLATVTQAQPASLVEVSGPVLSEL 

LRLLRSPLLPAGVLAAAEGFLQALVGTRPPCVDY 

AKLISLLTAPVYEQAVDGGPGLHKQVFHSLARC 

VAALSAACPQ\EAESTASRLVCDARSPHSSTGVK 

VLAFLSLAEVGQVAGPGHERELKAVLLEALGSPS 
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SEQTD 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

coirespondlDg 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanioe C=Cysteinc, O^Aspartic Acid, 
E>Clutanilc Acid, F<=Plienylaianine, G^GIycine, H»Histidine, 
I»IsoIeucine, K«Ly8ine» L"Lcucine, M^Metliioninc, 
N=Asparagine, P«»Proline, Q-Glutamine, R"Argiainei S=Serine, 
T°Threoninc, V^Vallne, W=Tryptophan, Y»Tyrosine, 
X^linlcnown, *=Stop codon, possible nucleotide deletion, 
>»possible nucleotide Insertion 










EDVRAAASYALGRVGAGSLPDFLPFLLEQIEAEP 
RRQYLLLHSLKEALGAAQPDSLKPYAEDIWALL 
FQRCEGAEEGTRGVVAECIGKLVLVNPSFLLPRL 
RKOLAAGRPHTRSTVITAVKFLISDOPHPIDPI T K 

SFIAVHNKPSLVRDLLDDILPLLYQETKIRRDLIRE 

VEMGPFKHTVDDGLDVRKAAFECMYSLLESCLG 

QLDICEFLNHVEDGLKDHYDIRMLTFIMVARLAT 

LCPAPVLQRVDRLIEPLRATCTAKVKAGSVKQEF 

EK.QDELKRSAMRAVAALLTIPEVGKSPIMADFSS 

QIRSNPELAALFESIQKDSTSAPSTDSMELS 


3756 

• 


A 


112 


1361 

• 


SLEEQQGRHPSFAPKCASQILGRIMrrLITEQLQK 

QTLDELKCTRFSISLPLPDHADISNCGNSFQLVSE 

GASWRGLPHCSCAEFQ/DQPQLQLPSLRPEPAPQ 

TRHRGNSPKEQPFSQVLRPEPPDPEKLPVPPAPPS 

KRHCRSLSVPVDLSRWQPVWRPAPSKLWTPIKH 

RGSGGGGGPQVPHQSPPKRVSSL/SVPPSSQCLFS 

ESDAESLSPCPPQRRFSLSPSLGPQASRFLPSARSS 

PASSPELPWRPRGLRNLPRSRSQPCDLDARKTGV 

KRRHEEDPRRLRPSLDFDKMNQKPYSGGLCLQE 

TAREGSSISPPWFMACSPPPLSASCSPTGGSSQVL 
SESEEEEEGAVRWGROAT WRTI rORnFnnT DT 

NLflSEN 


3757 


A 


413 


1 


PKPMLQQDFT/SLPDQGLDHIAE/NSYFDARSLCA 
AELVCKEWQQVTSE*MLWKKLIERMVHAYPLW 
KGLSEKVW/DQHLFKNRPTDGPPNSFHRSLYPKri 
QV1ETIESNWQCG*HTLQRIQCHSEKSKGVYCLQ 
YDDEK 


3758 


A 


2 


613 


FVSfiSPWRMDOSTFRT FARPPAfrRT PW<;<3R nPM 

r V OvJOI^ W IVIVII^vJlj l liJ\JUJu/VI\J\J^/\VJ A-Ur^ W OoXvs^CiVl 

TRRPSLMAGRQHGWSAQQSATVANPVPGANPD 

LLPHFLGEPEDVYIVKNKPVLLVCKAVPATQIFF 

KCNGEWVRQVDHVIERSTDGSSGLPTMEVRINV 

SRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKA 

YIRIAYLRKNFEQEPLAKEVSLEQGIVLPCRPPEGI 

PPAE 

Ami »Jl^ 


3759 


A 


1 


561 


ADDTLHLWNLRQKRPAILHSLKFCRERVTFCHLP 

FQSKWLYVGTERG>]1HIVNVESFTLSGYVIMWN 

KAIELSSKSHPGPWHISDl^PMDEGKLLIGFESGT 

VVLWDLKSKKADYRYTYDEAIHSVAWHHEGKQ 

FICSHSDGTLTIWNVRSPAKPVQTITPHGKQLKD 

GKKPEPCKPILKVEFXTTR 


3760 


A 

• 


1 


824 


LPACRCGCVAGCPSNHGICRCLRASERQVCVMH 
LKHLRTLLSPQDGAAKVTCMAWSQNNAKFAVC 
TVDRWLLYDEHGERRDKFSTKPADMKYGRJCS 

YMVKGMAFSPDSTKIAIGQTDNIIYVYKIGEDWG 

DKKVICNKFIQTVKFRPVPGTLG*TNrYQYIYL*IQ 

PGVAFLTSECDFSYCKDGASWLFMVICCLP*SPA 

VSFPIGD*\SAVTCLQWPAEYnVFGLAEGKVRLS 

NTKTNKSSTIYGTESYVVSLTTNCSGKGILSGHA 

DGYQR 


3761 


A 

« 


2253 


320 


PVIQRCSQPYGFSLLISFFLKCVSETSQQPPSRKVF 

QLLPSFPTLTRSKSHESQLGNRIDDVSSMRFDLSH 

GSPQMVRRDIGLSVTHRFSTKSWLSQVCHVCQK 

SMIFGVKCKHCRLKCHNKCTKEAPACRISFLPLT 

RLRRTESVPSDINOTVDRAAEPHFGTLPKALTKK 
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SEQ ID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C=<^ysteine, D»Aspartic Acid, 
£"Glutamic Acid, F»Pbenylalanine, G^Iycine, HaHistidine, 
lafsoleacine, K-Lysine, L^Leucine, M=MethioniDe, 
N=Asparagine, P^Proline, Q^lutamine, R'eArginine, S^erine, 
T'^Threonine, V=Valine, W=Tryptoplian, V=Tyroslne, 
XsUnliDown, *'^top codon, /^possible nucleotide deletion, 
V^possible nucleotide insertion 










EHPPAMNHLDSSSNPSSTTFSTPSSPAPFPTSSNPS 

SATTPP\NPSP\GQR\DSRFNFPSC/AYFIHHR\Q\QFI 

FPDISAFAHAAPLPEAADGTRLDDQPKADVLEAH 

EAEAEEPEAGKSEAEDDEDEVDDLPSSRRPWRG 

PISRKASQTSVYLQEWDIPFEQVELGEPIGQGRW 

GRVHRGRWHGEVAIRLLEMDGHNQDHLKLFKK 

EVMNYRQTRHENVVLFMGACMNPPHLAIITSFC 

KGRTLHSFVRDPKTSLDINKTRQIAQEIIKGMGYL 

GWP\EGRRENQLKLSHDWLCYLAPEIVREMTPG 

KDEDQLPFSKAADVYAFGTVWYELQARDWPLK 

NQAAEASIWQIGSGEGMKRVLTSVSLGKEVSEN 

LSACWAFDLQERPSVFSLLMDMLEKLPKLNRRLS 

HPGHF*KSADINSSKWPRFERFGLGVLESSNPK 

M 


3762 


A 


2 


1578 


MAHYITFLCMVLVLLLQNSVLAEDGEVRSSCRT 

APTDLVFILDGSYSVGPENFEIVKKWLVNITKNF 

DIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTA 

AVESILYLGGNTKTGKAIQFALDVLFAKSSRFLT 

KIAVVLTDGKSQDDVKDAAQAAKDSKITLFAIG 

VGSETEDAELRAIANKPSSTYVFYVEDYIAISKIR 

EVMKQKLCEESVCPTRIPVAARDERGFDILLGLD 

VNKKVKKRIQLSPKKIKGYEVTSKVDLSELTSNV 

FPEGLPPSYVFVSTQRFKVKKIWDLWRILTDDG/* 

POTA VTT TsiriVniflT T FTTTQVTMnQnWTPA'MPrkA/ 

r\^Lt\ V 1 lirfiN\J Y i^XVLi_rl^F 1 I 1 O V ilNUol^ V V 1 rAxSryJ V 

KTLFDEGWHQIRLLVTEQDVTLYIDDQQIENKPL 

HPVLGILINGQTQIGKYSGKEETVQFDVQKLRIY 

CDPEQNNRETACEIPGFCLNGPSDVGSTPAPCICP 

PGKPGLQGPKGDPGLPGNPGYPGQPGQDGKPVS 

TESLVISGISGITGYQGIAGTPGVPGSPGIQGARGL 

PGYKGEPGRDGDK 


3763 


A 


3 


1267 


CKVWRNPLNLFRGAEYNRYTWVTGREPLTYYD 

MNLSAQDHQTFFTCDSDHLRPADAIMQKAWRE 

RM>QARISAAHEALEINECATAYILLAEEEATnA 

EAEKLFKQALKAGDGCYRRSQQLQHHGSQYEA 

QHSVLYLPLQ\TRHQCLGVHQKKASNVCQKTRE 

DQGSSENDERFNEGVPPSEYVQYP*KPF\KALLEL 

OAYADVOAVI.AKYDDTST PKSATTPYTAAI T 

RAVSDKFSPEAASRRGLSTAEMNAVEAIHRAVEF 

NPHVPKYLLEMKSLILPPEHILKRGDSEAIAYAFF 

HLAHWKRVEGALNLLHCTWEGTFRMIPYPLEKG 

HLFYPYPICTETADRELLPSFHEVSVYPKKELPFFI 

LFTAGLCSFTAMLALLTHQFPELMGVFAKAVSV 

CLEGGLGEAVMGKAKGIKAA 


3764 


A 


25 


1032 


RSADGLCGNKDRERGNEFTRNQQAAQEVVNPK 
KKMKKKKYVNSGTVTLLSFAVESECTFLDYIKG 
GTQINFTVAIDFTASNGNPSQSTSLHYMSPYQLN 
AYALALTAVGEIIOHYDSDKMFPALGFGAKLPPD 

GRVSHEFPLNGNQENPSCCGIDGILEAYHRSLRT 

VQLYGPTNFAPWTHVARNAAAVQDGSQYSVL 

LU11)GVISDMAQ1XEAIVNG\SKLPMSIIIVGVGQ 

AEFNAMVELDGDDVRISSRGKLAERDIVQFVPFR 

DYVDRTGNHVLSMARLARDVLAEIPDQLVSYM 

KAQGIRPRSPPAAPTHSPSQSPARTPPACPLHTHI 


3765 


A 


172 


3456 


LGMMDSPKIGNGLPVIGPGTDIGISSLHMVGYLG 
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S£QID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue or 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^Alanine C=Cystcine, D»Aspartic Acid, 
E^Glutamic Acid, f^Phenylalanine, G=Glycine, H^Histidine, 
Msoleucine, K=Lysjne, L»Leucinc, M=Metliionine, 
N^Asparagine, PsFroline, Q=Glutamine, R°Arginine, S^erine, 
T-Thrconlne, V=VaUne, W^Tryptophan, Y=Tyrosine, 
X«Unknowii, *«Stop codon, A=possible nucleotide deletion, 
>F=possible nucleotide insertion 










KNFDSAKVPSDEYCPACKEKGKLKALKTYRISFQ 

ESDPLCEDLQCIYPLGSKSLNNLISPDLEECHTPHK 

PQKRKSLESSYKDSLLLANSKKTRNYIAIDGGKV 

LNSKHNGEVYDETSSNLPDSSGQQNPIRTADSLE 

R]^ILEADT\a)MATTKDPATVDVSGTGRPSPQN 

EGCTSKLEMPLESKCTSFPQALCVQWKNAYALC 

WLDCBLSALVHSEELKNTVTGLCSKEESIFWULL 

TKYNQANTLLYTSQLSGVKDGDCKKLTSEIFAEI 

ETCLNEVUDEIFISLQPQLRCTLGDMESPVFAFPL 

LLKLETHIEKLFLYSFSWDFECSQCGHQYQNRH 

MKSLXnFTNVIPEWHPLNAAHFGPCNNCNSKSQI 

RKKIVLEKVSPIFMLHFVEGLPQNDLQHYAFHFE 

GCLYQITSVIQYRANNHFITWILDADGSWLECDD 

LKGPCSERHKKFEVPASEIHIVIWERKISQVTDKE 

AACLPLKKTNDQHALSNEKPVSLTSCSVGDAAS 

AETASVTHPKDISVAPRTLSQDTAVTHGDHLLSG 

PKGLVDNILPLTLEETIQKTASVSQLNSEAFL\LBN 

KPVAENTGILKTNTLLSQESLMASSVSAPCNEKLI 

QDQFVDISFPSQVWTNMQSVQLNTEDTVNTKS 

VNNTDATGLIQGVKSVEIEKDAQLKQFLTPKTEQ 

LKPER\nrSQVSNLKKKETTADSQTTTSKSLQNQS 

LKENQKKPFVGSWVKGLISRGASFMPLCVSAHN 

RNTITDLQPSVKGVNNFGGFKTKGINQKASHVSK 

KARKSASKPPPISKPPAGPPSSNGTAAHPHAHAA 

SEVLEKSGSTSCGAQLNHSSYGNGISSANHEDLV 

buQuHUsxRLKJ^KJCKLFLAEK^ 

VRSENLEQVPQDGSPNrXTESlEDLLNELPYPIDIA 

NESACTTVPGVSLYSSQTHEEILAELLSPTPVSTE 

LSENGEGDFRYLGMGDSHIPPPVPSEFNDVSQNT 

HLRQDHNYCSPTKKNPCEVQPDSLTNNACVRTL 

NLESPMKTDIFDEFFSSSALNALANDTLDLPHFDE 

YLFENY 


3766 


A 


3 

• 


1622 


AQQIVYKNVMLENYKNLVSLGYQLTKPDVILRL 

EKGEEPWLVEREIHQETHPDSETAFEIKSSVSSRSI 

FKDKQSCDIKMEGMAR>fDLWYLSLEEVWKCRD 

QLDKYQENPERHLRQVAFTQKKVLTQERVSESG 

KYGGNCLLPAQLVLREYFHKRDSHTKSLKHDLV 

LNGHQDSCASNSNECGQTFCQNIHLIQFARTHTG 

DKSYKCPDNDKSLTHGSSLGISKGIHREKPYECK 

ECGKFFSWRSNLTRHQLIHTGEKPYECKECGKSF 

SRSSHLIGHQKTHTGEEPYECKECGKSFSWFSHL 

THTGEKPYECPECGKSFRQSTHLILHQRTHVRVR 

PYECNECGKSYSQRSHLVVHHRIHTGLKPFECKD 

CGKCFSRSSHLYSHQRTHTGEKPYECHDCGKSFS 

QSSALIVHQRIHTGEKPYECCQCGKAFIRKNDLnC 

HQRIHVGEETYKCNQCGIIFSQNSPFIVHQIAHTG 

EQFLTCNQCGTALVNTSNLIGYQTNHIRENAY 


J / u/ 


A 
A 






AnniWRMVMT PWVK'MT VQT HVOT TK'PnVlT PI 
/vV^Vc' ^ ' IvIN V iYLLfOlN I JNINjU V OL>\J I V|[JL« 1 JvIrL' V JUUJVLi 

EKGEEPWLVEREIHQETHPDSETAFEIKSSVSSRSI 

FKDKQSCDEKMEGMARNDLWYLSLEEVWKCRD 

QLDKYQENPERHLRQVAFTQKKVLTQERVSESG 

KYGGNCLLPAQLVLREYFHKRDSHTKSLKHDLV 

LNGHQDSCASNSNECGQTFCQNIHLIQFARTHTG 

DKSYKCPDNDNSLTHGSSLGISKGIHREKPYECK 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

locatioB 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=AJanine C=Cysteinc, D^Aspartic Acid, 
E^Glutamic Acid, Fs^Ptienylalanine, G^Glycine, H^Histidine, 
]»Isoleucine, K^Lysine, L^Lencine, M=Methiooine, 
N»Asparagine, P^Proline, Q=Glutaniinc, R^Arginine, S=5erine, 
T=Threoninc, VaVaiInc, W-Tryptopban, Y«Tyrosine, 
X»Unknown, *BStop codon, A^posslble nodeotide deletion, 
V^possible nucleotide insertion 










ECGKFFSWRSNLTRHQLIHTGEKPYECKECGKSF 

SRSSHLIGHQKTHTGEEPYECKECGKSFSWFSHL 

VTHORTHTGDKLYTCNOCGKSFA^HSSRLIRHOR 

THTGEKPYECPECGKSFRQSTHLILHQRTHVRVR 

PYECNECGKSYSQRSHLWHHRIHTGLKPFECKD 

CGKCFSRSSHLYSHQRTHTGEKPYECHDCGKSFS 

QSSALIVHQRIHTGEKPYECCQCGKAFIRKNDLIK 

HQRIHVGEETYKCNQCGIIFSQNSPFIVHQIAHTG 

EQFLTCNQCGTALVNTSNLIGYQTNHIRENAY 


3768 


A 


185 


2258 

■ 


SIIIKMSRKISKESKKVNISSSLESEDISLETTVPTD 

DISSSEEREGKVRITRQLIERKELLHNIQLLKIELS 

QKTMMIDNLKVDYLTKIEELEEKLNDALHQKQL 

LTLRLDNQLAFQQKDASKYQELMKQEMETILLR 

QKQLEETNLQLREKAODVRRSLRDFELTEEQYIK 

LBCAFPEDQLSIPEYVSVRFYELVNPLRKEICELQV 

KKNIIjVEELSTNKNQLKQLTETYEEDRKNYSEV 

qircqrlaleladtkqliqqgdyrqenydkvks 

erdaleqevffilrrkheileashmiqtkerselsk 

ewtlbqtvtllqkdkeylnrqnmelsvrcahe 

edrlerlqaqleeskbcareemyekyvasrdhy 

kteyenklhdeleqirlktnqeidqlrnasremy 

erenrnlreardnavaekeravmaekdalekh 

dqlldryre\lq\lsteskvteflhqsklksfese 

fyslqassekritelqaqnsehqarldiyeklek: 
eldelimqtaeienedeaervlfsygyganvptr 
akrrlkqsvhlarrvlqlekqnsli/lkrsgtsk 

GPSNTAFTRSLTEANSLLNQTQQPYRYLIESVRQ 

RDSKIDSLTESIAQL/ERKDVSNLNKEKSALLQTN 

GIKMAL\DL\DQLLNHP 


3769 


A 


3 


2297 


DAAEFRVVADAMKVIGFKPEEIQTVYKILAAILH 

LGNLKFVVDGDTPLIENGKVVSIIAELLSTKTDM 

VEKALLYRTVATGRDIIDKQHTEQEASYGRDAF 

AKAIYERLFCWTVTRINDIIEVKNYDTTIHGKNTV 

IGVLDIYGFEIFDNNSFEQFCINYCNEKLQQLFIQL 

VLKQEQEEYQREGIPWKHIDYFNNQirVDLVEQQ 

HKGIIAILDDACMNVGKVTDEMFLEALNSKLGK 

HAHFSSRBCLCASDKILEFDRDFRIRHYAGDVVYS 

VIGFroKNKDTLFQDFKRLMYNSSNPVLKNMWP 

EGKLSITEVTKRPLTAATLFKNSMIALVDNLASK 

EPYYVRCIKPNDKKSPQIFDDERCRHQVEYLGLL 

ENVRVRRAGFAFRQTYEKFLHRYKMISEFTWPN 

HDLPSDKEAVKKLIERCGFQDDVAYGKTKIFIRT 

PRTLFTLEELRAQMLIRIVLFLQKVWRGTLARMR 

YKRTKAALTIIRYYRRYKVKSYIHEVARRFHGVK 

TlSdDRDYGKHVKWPSPPKVLRRFEEALQTIFNRWR 

ASOLIKSIPASDLPOVRAKVAAVEMLKGORADL 

GLQRAWEGNYLASKPDTPQTSGTFVPVANELKR 

KDKYNINVLFSCHVRKVNRFSKVEDRAIFVTDRH 

LYKMDPTKQYKVMKTIPLYNLTGLSVSNGKDQL 

WFHTKDNKDLIVCLFSKQFTHESRIGELWGVLV 

NHFKSEKRHLQVXNVTNPVQCSLHGKKCTVSVE 

TRLNQPQPDFTKNRSGFILSVPGN 


3770 


A 


3 


6276 


HKVAAPDVVVPTLDTVRHEALLYTWLAEHKPL 
VLCGPPGSGKTMTLFSALRALPDMEWGLNFSS 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue of 
peptide 
sequence 


Amino acid sequence (A^Alanine C=Cysteine, D^Aspartic Add, 
E=Glutainic Acid, F°Phenylalanine, G^^GIydne, H=Histidine, 
I^Isoleucine, K^Lysine, L^^Leudne, MaMethionine, 
N'^Asparagine, P^ProIInc, Q^GIutamine, R"Ai^inine, S=Serine, 
T=Threonin€, V=Valine, W— Tryptophan, Y-Tyrosine, 
X^Uoknovn, *=Stop codon, A=po5sibJc nncleotide deletion, 
\«pos$lbIe Qudeotide insertion 

* 










ATTPELLLKTFDHYCEYRRTPNGVVLAPVQLGK 

WLVLFCDEINLPDMDKYGTQRVISFIRQMVEHG 

GFYRTSDQTWVKLERIQFVGACNPPTDPGRKPLS 

HRJLI^HVPVVYVDYPGPASLTQIYGTFNRAMLR 

LIPSLRTYAEPLTAAMVEFYTMSQERFTQDTQPH 

YIYSPREMTRWVRGIFEALRPLETLPVEGLIRIWA 

HEALRLFQDRLVEDEERRWTDENDDTVALKHFP 

NIDREKAMSRPILYSNWLSKDYIPVDQEELRDYV 

KARLKYFYEEELDVPLVLFNEVLDHVLRIDRIFR 

QPQGHLLLIGA^SGAGKTTLSRFVAWMNGLSVYQ 

IKVHRKYTGEDFDEDLRTVLRRSGCKNEKJAFIM 

DESNVLDSGFLERMNTLLANGEVPGLFEGDEYA 

TLMTQCKEGAQKEGLMLDSHEELYKWFTSQVIR 

NLHVVFTMNPSSEGLKDRAATSPALFNRCVLNW 

FGDWSTEALYQVGKEFTSKMDLEKPNYIVPDYM 

PWYDKLPQPPSHREAIVNSCVFVHQTLHQANA 

RLAKRGGRTMAITPRHYLDFINHYANLFHEKRSE 

LEEQQIMHLIWGLRKJKETVDQVEELRRDLRIKS 

QEIEVKNAAANDKLKKMVKI^QEAEKKKVMS 

QEIQEQLHKQQEVIADKQMSVKEDLDKVEPAVI 

EAQNAVKSIKKQHLVEVRSMANPPAAVKLALES 

ICLIXGESTTDWKQIRSUMRENFIPriVNFSAEEIS 

DAIREK^fKKNYMSOTSYNYEIVNRASIACGP 

KLVAIAQLNYADMIXRVEPLRNELQKLEDDAKD 

NQQKANEVEQMIRDLEASIARYKEEYAVLISEAQ 

AIKADLAAVEAKVNRSTALLKSLSAERERWEKT 

SETFKNQMSTIAGDCLLSAAFIAYAGYFDQQMR 

QNLFTTV^SHHLQQANIQFRTDIARTEYLSNADER 

LRWQASSLPADDLCTENAIMLKRFNRYPLIIDPS 

GQATEFIMNEYKDRKITRTSFLDDAFRKNLESAL 

RFGNPLLVQDVESYDPVLNPVLNREVRRTGGRV 

LITLGDQDIDLSPSFVIFLSTRDPTVEFPPDLCSRV 

TWNFIYTRSSLQSQCLNEVLKAERPDVDEKRSD 

LLKLQGEFQLRLRQLEKSLLQALNEVKGRILDDD 

TliTTLENLIGUEAAEVTRKVEETDIVMQEVETVS 

QQYLPLSTACSSIYFTMESLKQIHFLYQYSLQFFL 

DIYHNVLYENPNLKGVTDHTQRLSIITKDLFQVA 

FNRVARGMLHQDHITFAMLLARIKLKGTVGEPT 

YDAEFQHFLRGNEIVLSAGSTPRIQGLTVEQAEA 

VYRLSCLPAFKDLIAKVQADEQFGIWLDS'SSPEQ 

TVPYLWSEETPATPIGQAIHRLLLIQAFRPDRLLA 

MAHMFVSTNLGESFMSIMEQPLDLTQFVGTEVKP 

NTPVLMCSVPGYDASGHVEDLAAEQNTQITSIAI 

GSAEGFNQADKAINTAVKSGRWVMLKNYHLAP 

GWLMQLEKKLHSLQPHACFRLFLTMEINPKVPV 

NILRAGRJFVFEPPPGVKANMLRTFSSIPVSRICK 

SPNfERARLYFLLAWFHAnQERLRYAPLGWSKKY 

EFGESDLRSACDTVDTWLDDTAKGRQNISPDKIP 

WSALKTLMAOSIYGGRVDNEFDORLLNTFLERL 

FTTRSFDSEFKLACKVDGHKDIQMPDGIRREEFV 

QWELLPDTQTPSWLGLPNNAERVLLTTQGVD 

MISKMLKMQMLEDEDDLAYAETEKKTRTDSTS 

DGRPUWl^rLHTTASNWLHLIPQrLSHLKRTVE 

NKDPLFRFFE\REVKMGAKLLQ\DVRQDLADV\V 

QVaEGKK:KQTTmJlTLI\NELV\KGELP\RSWSHY 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

toHrst amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A«Alanine C^Cysteine, D^Aspartic Acid, 
£=G]utamic Acid, F'^Pbenylalanine, G=Glycine, H«Hlstidtne, 
I=Isolcucine, K^Lysine, L^Leucine, M-Methionine, 
N^Asparagine, P^Proline, Q^Glutamine, R»Ai^nlne, SsScrinei 
T«Threonine, V«=Valine, W=Tryptophan, Y^Tyrosine, 
X=Unknown, ^'^^top codon, /-possible nucleotide deletion, 
V'possible nucleotide iosertion 










TVPAG\MTVIQWGVPISARRI\KQLQNISL\AAASG 

GAKELKNIHVCLGGLFVPEAYTTATRQYVAQAN 

SWSLEELCLEVNVTTSQGATLDACSFGVTGLKL 

QGATCNNNKLSLSNAISTALPLTQLRWVKQTNT 

EKKASWTLPVYLNFTRADLIFTVDFEIATKEDPR 

SFYERGVAVLCTE 


3771 


A 


1 


2043 


LPLLHAGFNRRFMENSSnACYNELIQlEHGEVRS 

QFKLRACNSVFTALDHCHEAIEITSDDHVIQYVN 

PAFERMMGYHKGELLGKELADLPKSDKNRADL 

LDTINTCIKKGKEWQGVYYARRKSGDSIQQHVKI 

TPVIGQGGKIRHFVSLKKLCCTTDNNKQIHKIHR 

DSGDNSQTEPHSFRYKNRRKESIDVKSISSRGSDA 

PSLQNRRYPSMARfflSMTIEAPITKVINIINAAQEN 

SPVTVAEALDRVLEILRTTELYSPQLGTKDEDPH 

TSDLVGGLMTDGLRRLSGNEYVFTKNVHQSHSH 

LAMPITINDVPPCISQLLDNEESWDFNIFELEAITH 

KRPLVYLGLKVFSRFGVCEFLNCSETTLRAWFQ 

VIEANYHSSNAYHNSTHAADVLHATAFFLGKER 

VKGSLDQLDEVAALIAATVHDVDHPGRTNSFL\C 

NAGSELAVLYNDT\AV\LESHHTALAFQ\LTVKDT 

KXCNIFKNID/RGNHYRTLROAIIDMVLATEMTKH 

FEHVNKFVNSINKPMAAEIEGSDCECNPAGKNFP 

ENQILIKRMMIKCADVANPCRPLDLCIEWAGRIS 

EEYFAQTDEEKRQGLPVVMPVFDRNTCSIPKSQI 

SFIDYFITDMFDAWDAFAHLPALMQHLADNYKH 

WKTLDDLKCKSLRLPSDRLKPSHRGGLLTDKGH 

CESQ 


3772 


A 


1013 


50 


TLVHADGFPSLHITETCLAYREKRIGIDLVHDTVE 

HELIKEAEIIQGDVIALLTRTLEEASEQIRNDSIRSAK 

YNLEKDLKDKFVALTEDDICFSLNNNSPNIRYSEN 

AVRIEPNSVSLEDWLDFSSTNVEKADKORJSTNSL 

MLKALVD\RILSQTANYLRKQCDWHTAFKNGL 

KDTKDARDQLADHLAKWMEEIASQEKNITALEK 

ADLDQEGPAKVAHTRLETRTHRPNVELCRDVAQ 

YRLMKEVQEITHNVARLKETLAVQAQAELKGLH 

RRQLALQEEIQVKENTIYIDEVLCMQMRKSIPLR 

DGEDHGVWAGGLRPDAVC 


3773 


A 


1 


955 


AAARESERQLRLRLCVLNEILGTERDYVGTLRFL 

QSAFLHRIRQNVADSVEKGLTEENVKVLFSNIEDI 

LEVHKDFLAALEYCLHPEPQSQHELGNVFLKFK 

DKFCVYEEYCSNHEKALRLLVELNKIPTVRAFLL 

SCMLLGGRKTTDIPLEGYL\LSPIQRICKYPLLLKE 

LAKRTPGKHPDHPAVQ\SALQAMKTVCSNINETK 

RQMEKLEALEAAA/QSHIEGWEGSNLTDICTQLL 

LQGTLLKISAGNIQERAFFLFDNLLVYCKRKSRV 

TGSKKSTKRTKSINGSLYIFRGRINTEVMEVENVE 

DGTGSPSPSLA 


3774 


A 


4254 


2061 


ELQGDFSVPDVPKSMAWCENSrCVGFKRDYYLI 

RVDGKGSIKELFPTGKQLEPLVAPLADGKVAVG 

QDDLTVVLNEEGICTQKCALNWTDIPVAMEHQP 

PYIIAVLPRYVEIRTFEPRLLVQSBELQRPRFITSGG 

SNIIYVASNHFVWRLPVPMATQIQQLLQDKQFE 

LALQLAEMKDDSDSEKQQQIHHIKNLYAFNLFC 

QKRPDESMQVFAKLGTDPTHVMGLYPDLLPTDY 

RKQLQYPNPLPVLSGAELEKAHLALIDYLTQKRS 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nudeoddc 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A'^Alanine C=Cysteine, D»Aspartic Acid, 
E^Glutamic Acid, F^Ptienylalanine, G»Glycine, H°Histidine, 
I^Isoleuclne, K»Lysine, Lr^Leucine, M»Methionine, 
N">Aspanigin€, F^Proline, Q=KSlutamine, R»Arginlne, S=4erine, 
T=Threonlne, V'=Vaiine, W^Trypfophan, Y«=Tyrosine, 
X^Unlcnown, *»^top codon, /=*pQssibie nucleotide deletion, 
V^ossible nudeodde insertion 










QLVKKLNDSDHQSSTSPLMEGTPTIKSKKKLLQr 

DTTLLKCYLHTNVALVAPLLRLENMiCHIEESEH 

VLKKAHKYSELnLYEKKGLHEKALQVLVDQSK 

KANSPLKGHERTVQYLQHLGTENLHLIFSYSVW 

VLRDFPEDGLKDFTEDLPEVESLPRDRVLGFLIEN 

FKGLAIPYLEHIIHVWEETGSRFHNCLIQLYCEKV 

QGLMKEYLLSFPAGKTPVPAGEEEGELGEYRQK 

LLMFLEISSYYDPGRLICDFPFDGLLEERALLLGR 

MGKHEQALFIYVHILKDTRMAEEYCHKHYDRN 

KDGNKDVYLSLLRMYLSPPSIHCLGPIKLELLEPK 

ANLQAALQVLELHHSKLDTTKALNLLPANTQIN 

DIRIFLEKVLEENAQKKRFNQVLKNLLHAEFLRVV 

QEERILHQQVKCUTEEKVCMVCKKKIGNSAFAR 

YPNGVVVHYFCSXKEVNPADT 


3775 


A 


1832 


839 


MSRARGALCRACLALAAALAALLLLPLPLPRAP 
APARTPAPAPRAPPSRPAAPSLRPDDVFIAVKTTR 
KNHGPRLRLLLRTWUSRARQQTFIFTDGDDPELE 
LOGGDRVINTNCSAVRTROALCCKMSVEYDKFT 

JL^^r A V W AJL^ A A i^^kr#^ T A A^^^«AA>«^^^^A^mtA^7 T A J^XmA 

ESGRKWFCHVDDDNYVNARSLLHLLSSFSPSQD 

VYLGRPSLDHPIEATERVQGGRTVTTVnKFW 

GGAGFCLSRGLALKMSPWASLGSFMSTAEQVRL 

PDDCTVGYIVEGLLGARLLHSPLFHSHLENLQRL 

PPDTLLQQVTLSHGGPENPQNVVNVAGGFSLHQ 

DPTRFKSraCLLYPDTDWCPRQKQGAPTSR 


3776 


A 


3 


796 


PRAKLGTRARNMAGQDAGCGRGGDDYSEDEGD 
SSVSRAAVEVFGKLKDLNCPFLEGLYITEPKTIOE 

V V WAwA^^A T V A A^A^A^M^A^4 ^ ^^A A ^^^m^^Wmi AAA A^L A A\^ A^ 

LLCSPSEYRLEILEWMCTRVWPSLQDRFSSLKGV 

PTEVKIQEMTKLGHELMLCAPDDQELLKGCACA 

QKQLHUFMDQLLDTIRSLTIGCSSCSSLMEHFEDT 

EIEKNEALLGELFSSPHLQMLLNPECDPWPLDMQ 

PLLNKOSDDWOWASASAKSEEEEKLAELAROLO 

ESAAKLHALRTEYFAQHEQGAAAGAAMSAP 


3777 


A 


3 


413 


SEEDVIEGKTAVIEKRRJKKRSSAGVVED/IGGEVQ 

NMLEGVGVDINKALLAKRKRLEMYTKASLRTSN 

QKJEHVWKTQQDQRQKLNQEYSQQFLTLFQQW 

DLDMQKAEEQEEKILVGIMIRFIINQVSSRNGQPS 

LLL 


3778 


A 


132 


788 


SRLPPPPPHLADGRAGARVPRSARLSRWWVOD 

WTHGPIVRPPAAARTMWVNPEEVLLANALWITE . 

RANPYFILQRRKGHAGDGGGGGGLAGLLVGTLD 

VVLDSSARVAPYRILYQTPDSLVYWTIACG\GSR 

KEITEH^^^WLEQNLLQTLSIFENENDITTFVRGKI 

QGIIAEYNKINDVKEDDDTEKFKEAIVKFHRLFG 

MPEEEKLVNYYSCSYWKG 


3779 


A 


2 


934 


CKSCTLFPQNPNLPPPSTRERPPGCKTVFVGGLPE 

NATEEIIQEVFEQCGDITAIRKSKKNFCHIRFAEEF 

MVDKAIYLSGYRMRLGSSTDKKDSGRLHVDFA 

OARDDFYEWECKORMRARBERHRRKLEEDRLR 

PPSPPAIMHYSEHEAALLAEKLKDDSKFSEAM\Q 

VLLSW1ERGEVNRR\SANQFYSMVQSANSHVRRL 

MNEKATHEQEMEEAKENFKNALTGILTQFEQIV 

AVFNASTRQKAWDHFSKAQRKNIDIWAK\HSEE 

LRNAQSEQLMGIRREEEMEMSDDENCDSPTKKIVI 

RVDESALGAP 


3780 


A 


1 


2535 


AAQAEREELAAGRMPGGGPQGAPAAAGGGGVS 
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SEQlD 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucieotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alaninc C=Cysteine, D^Aspartic Acid, 
E=Glutainic Acid, F»Flienylalanine, G-Glycioe, H==Histidine, 
I=Isoicucine, K=Lysine, l/>Leiicine, M"Methionine, 
N»Asparagine, P=Frolinc, Q^Glutamine, R^Arglnine, S^Serine^ 
T°Tbreonine, V=Valinc, W^Tryptoplian, Y^Tyrosinc, 
X=i;nlcnown, *=^top codon, /^possible nucleotide ddetiofl, 
\»possible nucleotide insertion 










HRAGSRDCLPPAACFRRRRLARRPGYMRSSTGP 

GIGFLSPAVGTLFRFPGGVSGEESHHSESRARQC 

GLDSRGLLVRSPVSKSAAAPTVTSVRGTSAHFGI 

QLRGGTRLPDRLSWPCGPGSAGWQQEFAAMDS 

SETLDASWEAACSDGARRVRAAGSLPSAELSSNS 

CSPGCGPEVPPTPPGSHSAFTSSFSFIRLSLGSAGE 

RGEAEGCPPSREAESHCQSPQEMGAKAASLDGP 

HEDPRCLSQPFSLLATRVSADLAQAARNSSRPER 

DMHSLPDMDPGSSSSLDPSLAGCGGDGSSGSGD 

AHSWDTLLRKWEPVLRDCLLRNRRQMEVISLRL 

KLQKLQEDAVENDDYDKAETLQQRLEDLEQEKI 

SLHFQLPSRQPALSSFLGHLAAQVQAALRRGATQ 

QASGDDTHTPLRMEPRLLEPTAQDSLHVSITRRD 

WLLQEKQQLQKEIEALQARMFVLEAKDQQLRRE 

lEEQEQQLQWQGCDLTPLVGQLSLGQLQEVSKA 

LQDTLASAGQIPFHAEPPETIRSLQERIKSLNLSLK 

EITTKVCMSEKFCSTLRKKVNDIETQLPALLEAK 

MHAISGNHFWTAKDLTEEIRSLTSDREGLEGLLS 

KLLVLSSRNVKKLGSVKEDYNRLRREA^HOETA 

YETSVKENTMKYMETLKNKLCSCKCPLLGKVW 

EADLEACRLLIQCLQLQEARGSLSVEDERQMDD 

LEGAAPPIPPRLHSEDKRKTPLKESYILSAELGEK 

CEDIGKKLLYLEDQLHTAIHSHDEDLIQSLRRELQ 

MVKETLQAMILQLQPAKEAGEREAAASCMTAG 

VHEAQA 


3781 


A 


3 


995 


GRRRAGPAHSARMYNMMETELKPPGPQQTSGG 

GGGNSTAAAAGGNQKNSPDRVKRPMNAFMVW 

SRGQRRKMAQENPKMHNSEISKRLGAEWKLLSE 

TEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTK 

TLMKKDKYTLPGGLLAPGGNSMASGVGVGAGL 

GAGVNQRMDSYAHMNGWSNGSYSMMQDQLG 

YPQHPGLNAHGAAQMQPMHRYDVSALQYNSM 

TSSQTYMNG/SRPTYSMSYSQQGTPGMAPGSMVIG 

SVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMIS 

MYLPGAEVPEPAAPSRLHMSQHYQSGPVPGTAI 

NGTLPLSHM 


3782 


A 


1 

4 


2649 


FRVPDSCPVVLHSFTQLDPDLPRPESSTQEIGEELI 

NGVIYSISLRKVQLHHGGNKGQRWLGYENESAL 

NLYETCKVRTVKAGTLEKLVEHLVPAFQGSDLS 

YVTIFLCTYRAFTTTQQVLDLLFKRYGRCDALTA 

SSRYGCILPYSDEDGGPQDQLKNAISSILGTWLD 

QYSEDFCQPPDFPCLKQLVAYVQLNMPGSDLER 

RAHLLLAQLEHSEPIEAEPEGEEDWALSPVPALK 

PTPELELALTPARAPSPVPAPAPEPEPAPTPAPGSE 

LEVAPAPAPELQQAPEPAVGLESAPAPALELEPA 

PEQDPAPSQTLELEPAPAPVPSLQPSWPSPVVAEN 

GLSEEKPHLLVFPPDLVAEQFTLMDAELFKKWP 

YHCLGSIWSQRDKKGKEHLAPTIRATVTQFNSV 

ANCVITTCLGNRSTBCAPDRARWEHWIEVAREC 

RILKNFSSLYAILSALQSNSIHRLKKTWEDVSRDS 

FRIFQKLSEIFSDENNYSLSRELLIKEGTSKFATLE 

MNPKRAQKRPKETGnQGTVPYLGTFLTDLVML 

DTAMKDYLYGRLINFEKRRKEFEVIAQIKLLQSA 

CNNYSIAPDEQFGAWFRAVERLSETESYNLSCEL 

EPPSESASNTLRTKKNTAIVKRWSDRQAPSTELS 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A'=Alanine C=Cysteine, D»Aspartic Acid» 
£'=<;iutanilc Acid, F^Phenylalanine, GM^Iycine, H^Mistidinc, 
I«IsoIeucine, K=Lysine, L^Leucine, M'^Metliioniiie, 
N^'Asparagine, P^Proline, Q^GIutamine, RaAi^ginine, S==Serine, 
T=Tlirconine, V^^Valine, W»Tryptoplian, Y^Tyrosinc, 
X>=llnknowo, *=Stop codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion 










TSGSSHSKSCDOLRCGPYLSSGDIADALSVHSAG 

SSSSDVEEINISFVFESPDGQEKKFWESASQSSPET 

SGISSASSSTSSSSASTTPVAATRTHKRSVSGLCNS 

SSALPLYNQQVGDCCIIRVSLDVDNGNMYKSILV 

TSQDKAPAVIRKAMDKHNLEEEEPEDYELLQILS 

DDRKLKIPENANVFYAMNSTANYDFVLKKRTFT 

KGVKVKHGASSTLPRMKQKGLKIAKGIF 


3783 


A 


3 


869 


RSGQGKVYGLIGRRRFQQMDVLEGLNLLITISGK 

Rl^RVVTLSWLRNKILHNDPEVEKICQGWTTV 

GDMEGCGHYRVVKYERTKFLVIALKSSVEVYAW 

APKPYHKFMAFKSFADLPHRPLLVDLTVEEGQR 

LKVIYGSSAGFHAVDVDSGNSYDIYIPVHIQSQIT 

PHAIIFLPNTDGMEMLLCYEDEGVYVNTYGRIIK 

DWLQWGEMPTSVAYICSNQIMGWGEKAIEIRS 

VETOHLDGWMHKRAQRLKFLCEKNDKVFFASV 

RSGGSSQYYFMTLNRNCIMNW 


3784 


A 


1213 


457 


LSPRQVDGLAGLQKGLSLSLLYQFLMNGIRLGTY 

AYLGSPIYMVKTHLQAQAASEIAVGHQYBCHQG 

MFQALTEIGQKHGLVGLWRGALGGLPRVIVGSS 

TQLCTFSSTKDLLSQWEIFPPQSWKLALVAAMM 

SGIAWLAMAPFDVACTRLYNQPHRCTGQGP\LY 

RGILDALLQTARTEGEFGMYKGIGASYFRLGPHTI 

LSLFFWDQLRSLYYTDTK 


3785 


A 


193 


813 


SKHYVYIINVTWSDSTSQTIYRRY\SKFFDLQMQL 

LD\KFPI\ESGQKDPKQRIIPFLPGKILFRRSHIRDV 

AVKRLKPIDEYCRALVRLPPHISQCDEVFRFFEAR 

PEDVNPPKEQGPSPPDAVLPYGVNKGKQELKAG 

PNWPGRTHHVVNCVTQKCLFVFHFKFSSSGNnKE 

SKSL 


3786 


A 


3785 


1632 


EFVGRAASTTVVTRIAWRMADAGIRRVVPSDLY 

PLVLGFLRDNQLSEVANKFAKATGATQQDANAS 

SLLDIYSFWLNRSAKVPERKLQANGPVAKKAKK 

KASSSDSEDSSEEEEEVQGPPAKKAAVPAKRVGL 

PPGKAAAKASESSSSEESSDDDDEEDQKKQPVQ 

KGVKPQAKAGQAPPKKAKSSDSDSDSSSEDEPP 

KNQKPKITP\VTVKAQTKAPPKPARA\APKIANGK 

AASSSSSSSSSSSSDDSEEEKAAATPKKTVPKKQV 

VABCAPVKAATTPTRKSSSSEDSSSDEEEEQBCKPM 

KNKPGPYSSVPPPSAPPPKKSLGTQPPKKAVEKQ 

QPVESSEDSSDESDSSSEEEKKPPTKAWSKATTK 

PPPAKXAAESSSDSSDSDSSEDDEAPSKPAGTTK 

NSSNKPAVTTKSPAVKPAAAPKQPVGGGQKLLT 

RKADSSSSEEESSSSEEEKTKKMVATTKPKATAK 

AALSLPAKQAPQGSRDSSSDSDSSSSEEEEEKTSK 

SAVKKKPOKVAGGAAPSKPASAKJCGKAESSNSS 

SSDDSSEEEEEKLKGKGSPRPQAPKANGTSALTA 

QNGKAAKNSEEEEEEKKKAAVVVSKSGSLKKR 

KQNEAAKEAETPQAKKIKLQTPNTFPfCRKKGEK 

RASSPFRRVREEEIEVDSRVADNSFDAKRGAAGD 

WGERANQVLKFTKGKSFRHEKTKKKRGSYRGG 

SISVQVNSIKFDSE 


3787 


A 


3 


5078 


IPEG/RALSAEHTSSLVPSLHITTLGQEQAILSGAV 
PASPSTGTADFPSILTFLQPTENHASPSPVPEMPTL 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^Alanine C=Cysteine, D^Aspartic Acid, 
E>=Glutamic Acid, F^Phenylalanine, G=Glycine, H^Histidine, 
I^Isoleucine, K=Lysine, D=Leucinc, M=IVIetbiotiine, 
r^Asparagine, P^ProIine, Q^GIutamine, BpArgiaine, S«Serine, 
T^Threonine, V«Vaiine, WaTryptophan, Y=Tyrosinc, 
XsUnlcnowiij *=Stop codon, /^possible nucleotide deletion, 
\=5pos5ible nucleotide insertion 


• 








PAEGSDGSPPATRDLLLSSKVPNLLSTSWTFPRW 

KKDSVTAILGKNEEANVTIPLQAFPRKEVLSLHT 

VNGFVSDFSTGSVSSPnTAPRTNPLPSGPPLPSILS 

IQATQTVFPSLLAFSSTKPEVYAAAVDHSGLPAS 

APKQVRASPSSMDVYDSLTIGDMKKPATTDVFW 

SSLSAETGSLSTESIISGLQQQTNYDLNGHTISTTS 

WETHLAPTAPPNGLTSAADAIKSQDFKDTAGHS 

VTAEGFSIQDLVLGTSIEQPVQQSDMTMVGSHID 

LWPTSNNNHSRDFQTAEVAYYSPTTRHSVSHPQ 

LQLPNQPAHPLLLTSPGPTSTGSLQEMLSDGTDT 

GSEISSDINSSPERNASTPFQNILGYHSAAESSISTS 

VFPRTSSRVLRASQHPKKWTADTVSSKVQPTAA 

AAVTLFLRKSSPPALSAALVAKGTSSSPLAVASG 

PAKSSSMTTLAKNVTNKAASGPKRTPGAVHTAF 

PFTPTYMYARTGHTTSTHTA/IARKHGHCLWPVV 

YNLP/PP/GKPQAMHTGLPNPTNLEMPRASTPRPL 

TVTAALTSITASVKATRLPPLRAENTDAVLPAAS 

AAVVTTGKMASNLECQMSSKLLVKTVLFLTQRR 

VQISESLKFSIAKGLTQALRKAFHQNDVSAHVDI 

LEYSHNVTVGYYATKGKLVYLPAWIEMLGVY 

GVSNVTADLKQHTPHLQSVAVLASPWNPQPAG 

YFQLKTVLQFVSQADNIQSCKFAQTMEQRLQKA 

FQDAERKVLNTKSNLTIQIVSTSNASQAVTLVYV 

VGNQSTFLNGTVASSLLSQLSAELVGFYLTYPPL 

TIAEPLEYPNLDISETTRDYWVITVLQGVDNSLV 

GLHNQSFARVMEQRLAQLFMMSQQQGRRFKRA 

rnLGSYTVQMVKMQRVPGPKDPAELTYYTLYN 

GKPLLGTAAAOLSTIDSQRMALTLHHWLLQAD 

PVVKNPPNNLWnAAVLAPIAVVTVIIIIITAVLCR 

KNKNDFKPDTMINLPQRAKPVQGFDYAKQHLG 

QQGADEEVIPVTQETVVLPLPIRDAPQERX)VAQD 

GSTIKTAKSTETKKSRSPSENGSVISNESGKPSSGR 

RSPQNVMAQQKVTKEEARKRNVPASDEEEGAV 

LFDNSSKVAAEPFDTSSGSVQLIAIKPTALPMVPP 

TSDRSQESSAVLNGEVNKALKQKSDIEHYRNKL 

RLKAKRKGYYDFPAVETSKGLTERKKMYEKAP 

KEMEHVLDPDSELCAPFTESKNRQQMKNSVYRS 

RQSLNSPSPGETEMDLLVTRERPRRGIRNSGYDT 

EPEIIEETNEDRVPEPRGYSRSRQVKGHSETSTLSS 

QPSIDEVRQQMHMLLEEAFSLASAGHAGQSRHQ 

EAYGSAQHLPYSEVVTSAPGTMTRPRAGVQWVP 

TYRPEMYQYSLPRPAYRFSQLPEMVMGSPPPPVP 

PRTGPVAVASLRRSTSDIGSKTRMAESTGPEPAO 

LHDSASFTQMSRGPVSVTQLDQSALNYSGNTVP 

AVFAIPAANRPGFTGYFIPTPPSSYRNQAWMSYA 

GENELPSQWADSVPLPGYIEAYPRSRYPQSSPSRL 

PRQYSQPANLHPSLEQAPAPSTAASQQSLAENDP 

SDAPLTNISTAALVKAIREEVAKLAKKQrDMFEF 

QV 


3788 


A 


2 


1737 


MKGLYTDAEMKSDNVKDKDAKISFLQKAIDVV 

VMVSGEPLLAKPARIVAGHEPERTNELLQnGKC 

CLNKLSSDDAVRRVLAGEKGEVKGRASLTSRSQ 

ELDNKNVREEESRVHKNTEDRGDAEIKERSTSRD 

RKQKEELKEDRMPREKDKDKEKAKENrGG>fRHR 

EGERERAKARARPDNERQKDRGNRERDRDSERK 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=*Alanine C^Cysteine, D«Aspartic Acid, 
E=Glntamic Acid, F=Phcnylalanine, G=Glyclne, H^Histidiae, 
I^^Isoleucine, K=Lysine, L=Leuciae, M^Metliionine, 
N^Asparagine, P=Proline, Q=<;iutamiae, R^Arginine, S^erioe, 
T"Tbreonine, V=Valine, W-Tryptophan, Y=a>rosine, 
X=Unknown, *=Stop codon, A=possible nucleotide deletion, 
\«pQSsible audeotJde insertion 










KETERKSEGGKEKERLRDRDRERDRDKGKDRDR 

RRVKNGEHSWDLDRENNREHDKPEKKSASSGE 

MSKKLSDGTFKDSKAETETEISTRASKSLTTKTS 

KRRSKNSVEGDSTSDAEGDAGPAGQDKSEVPET 

PEIPNELSSNIRRJPRPGSARPAPPRVKRQDSMEAL 

APQLSEMSEmMVTAVELEEEEKHGGLVKKDLET 

KKDYEKLQQSPKPGEKERSLFESAWKKEKDIVS 

KEmKLRTSIQTLCKSALPLGKJMDYIQEDVDAM 

QNELQM\YHSENRQHAEALQQEQRITDCAVEP\L 

KAELA\ELEQLIKD\Q\QDKICAVKANILKNEEKIQ 

KMVYSINLTSRR 


3789 


A 


1 


4369 


MRTLGTCLATLAGLLLTAAGETFSGGCLFDEPYS 

TCGYSQSEGDDFTSrWEQVNTLTKPTSDPWMPSGS 

FMLVNASGRPEGQRAHLLLPQLKENDTHCIDFH 

YFVSSKSNSPPGLLNVYVKVNNGPLGNPIWNISG 

DPTRTWNRAELAISTFWPNFYQVIFEVITSGHQG 

YLAIDEVKVLGHPCTRTPHFLRIQNVEVNAGQFA 

TFQCSAIGRTVAGDRLWLQGIDVRDAPLKEIKVT 

SSRRFIASFNVVTmTCRDAGKYRCMI\RTEGGVGI 

SNYAELWVKEPPVPIAPPQLASVGATYLWIQLN 

ANSINGDGPIVAREVEYCTASGSWNDRQPVDSTS 

YKIGHLDPDTEYEISVLLTRPGEGGTGSPGPALRT 

RTKCADPMRGPRKLEVVEVKSRQITIRWEPFGY 

NVTRCHSYNLTVHYCYQVGGQEQVREEVSWDT 

ENSHPQHTITNLSPYTNVSVKLILMNPEGRKESQ 

ELIVQTDEDLPGAVPTESIQGSTFEEKIFLQWREP 

TQTYGVITLYEITYKAVSSFDPEIDLSNQSGRVSK 

LGNETHFLFFGLYPGTTYSFTIRASTAKGFGPPAT 

NQFTTKISAPSMPAYELETPLNQTDNTVTVMLKP 

AHSRGAPVSVYQIWEEERPRRTKKTTEILKCYP 

VPIHFQNASLLNSQYYFAAEFPADSLQAAQPFTIG 

DNKTYNGYWNTPLLPYKSYRIYFQAASRANGET 

KIDCVQVATKGAATPKPVPEPEKQTDHTVKIAG 

VIAGILLFVIIFLGVVLVMKKRKLXAKKPlKErMSS 

TRQEIDLWIGELNGPRSYAEQGTKLATRAFSFMD 

THNLNGRSVSSPSSFTMKTNTLSTSVPNSYYPDE 

THTMASDTSSLVQSHTYKKREPADVPYQTGQLH 

PAIRVADLLQHITQMKCAEGYGFKEEYESFFEGQ 

SAPWDSAKKDENRMKNRYGNIIAYDHSRVRLQT 

lEGDTNSDYINGNYIDGYHRPNHYIATQGPMQET 

lYDFWRMVWHENTASIIMVTNLVEVGRVKCCK 

YWPDDTEIYKDIKVTLIETELLAEYVIRTFAVEKR 

GVHEIREIRQFHFTGWPDHGVPYHATGLLGFVR 

QVKSKSPPSAGPLWHCSAGAGRTGCFIVIDIML 

DMAEREGVVDIYNCVRELRSRRVNMVQTEEQY 

VFIHDAILEACLCGDTSVPASQVRSLYYDMNKLD 

PQTNSSQIKEEFRTLNMVTPTLRVEDCSIALLPRN 

HEKNRCMDILPPDRCLPFLITIDGESSNYINAALVI 

DSYKQPSAFIVTQHPLPNTVKDFWRLVLDYHCTS 

VVMLNDVDPAQLCPQYWPENGVHRHGPIQVEF 

VSADLEEDnSRIFRIYNAARPQDGYRMVQQFQFL 

GWPMYRDTPVSKRSFLKLIRQVDKWQEEYNGG 

EGRTWHCLNGGGRSGIFCAISIVCEMLRHQRTV 

DWHAVKTLRNNKPNMVDLLDQYKFCYEVALE 
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SEQKD 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last aniDO 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C^Cysteine, D^Aspartic Acid, 
E^Glutamic Acid» F==Phenyl alanine, G=Glycine, H^Histidine, 
iBlsoleucine, K-Lysine, L^Leucine, M^Methioniiie, 
N»A5paragine, P^roline, Q-Glutamine, RaArginine, S^erloe, 
l^Threonine, V=Vallne, W»Tryptophan, Y«Tyraslne, 
X=Unkno>vn, *=>Stop codon, /^possible nucleotide deletion, 
V'Bpossible nucleotide insertion 










YLNSG 


3790 


A 


261 


485 


EEQTFLHJASRLGKraVQLLLQHMAHPDAATTN 
GYTPLHISAREGQV\DV\ASVLLGRQGAAHSFRLT 
KVRRMTS 


3791 


A 


1 


5874 


lppvtmsgkyimeehdsysdqvwsidelpskqg 

yylqgnylrcvaevgsfehnlttdllnhlvfvq 

kvfmkevneyiqkvsggeqpiplwnehdgtadg 

dkpkillyslnlqfkgiqvtattpsmravrfetg 

lielelsnrlqtkaspgsssylklfgkcqvdlnl 

algqivkhqvyeeagsdfhqvayfktriglrna 

lreeisgssdreavlitlnrpiwaqpvafdravl 

fwlnyk\aaydnwhnbqrmalhkdihmatkevv 

dmlpgiqqtsaqafgtpflqltvndlgiclpitnt 

aqsnhtgdldtgsalvltiestlitacsseslvsk: 

ghfknfcirfadgfetswddwkpeihgdlvmna 

CWPDGTYEVCSRTTGQAAAESSSAGTWTLNVL 

WKMCGIDVHMDPNIGKRLNALGNTLTTLTGEED 

roDIADLNSVNIADLSDEDEVDTMSPTIHTEATDY 

RRQAASASQPGELRGRKIMKRIVDIRELNEQAKV 

IDDLKKLGASEGTINQEIQRYQQLESVAVNDIRR 

DVRKKLRRSSMRAASLKDKWGLSYKPSYSRSKS 

ISASGRPPLKRMERASSRVGETEELPEIRVDAASP 

GPRVTFNIQDTFPEETELDLLSVTIEGPSHYSSNSE 

GSCSVFSSPKTPGGFSPGIPFQTEEGRRDDSLSSTS 

EDSEKDEKDEDHERERFYIYRKPSHTSRKKATGF 

AAVHQLFTERWPTTPVNRSLSGTATERNIDFELD 

IRVEIDSGKCVLHPTTLLQEHDDISLRRSYDRSSR 

SLDQDSPSKKKKFQT>ryASTTHLMTGKKVPSSL 

QTKPSDLETTVFYIPGVDVKLHYNSKTLKTESPN 

ASRGSSLPRTLSKESKLYGMKDSATSPPSPPLPST 

VQSKTNTLLPPQPPPIPAAKGKGSGGVKTAKLYA 

WVALQSLPEEMVISPCLLDFLEKALETIPITPVER 

NYTAVSSQDEDMGHFEIPDPMEES\TTSLVS\SSTS 

AYSSFPVDVVVYVRVQPSQIKFSCLPVSRVECML 

KLPSLDLVFSSNRGELETLGTTYPAETLSPGGNA 

TQSGTKTSASKTGIPGSSGLGSPLGRSRHSSSQSD 

LTSSSSSSSGLSFTACMSDFSLYVFHPYGAGKQIT 

AVSGLTPGSGGLGNYDEEPTSVTGRKDSLSINLE 

FVKVSLSRJRRSGGASFFESQSVSKSASKMDTTLI 

NISAVCDIGSASFKYDMRRLSEILAFPRAWTRRSI 

ARRLFLGDQTINLPTSGPGTPDSIEGVSQHLSPESS 

RKAYCKTWEQPSQSASFTHMPQSPNVFNEHMTN 

STMSPGTVGQSLKSPASIRSRSVSDSSVPRRDSLS 

KTSTPFNKSNKAASQQGTPWETLVVFAINLKQL 

NVQMNMSNVMGNTTWTTSGLKSQGRLSVGSNR 

DREISMSVGLGRSQLDSKGGWGGTTDVNALEM 

VAHISEHPNQQPSHKIQITMGSTEARVDYMGSSIL 

MGIFSNADLKLQDEWKVNLYNTLDSSITDKSEIF 

VHGDLKWDIFOVMISRSTTPDLTICIGMKLOEFFT 

QQFDTSKRALSTWGPVPYLPPKTMTSNLEKSSQE 

QLLDAAHHRHWPGVLKWSGCHISLFQIPLPEDG 

MQFGGSMSLHGNHMTLACFHGPNPRSKSWALF 

HLEEPMAFWTEAQKIWEDGSSDHSTYIVQTLDF 

HLGHNTMVTKPCGALESPMATITKITRRRHENPP 

HGVASVKEWFNYVTATRNEELNLLRNVDANNT 
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SEQJD 
NO: 


Method 


Predicted 

beginning 

nucieotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucieotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A'^'Alanine C=Cysteiae, I>=Aspartic Acid» 
E>==Glutainic Add, F»Pbenyialanine, G^lycine, H-Hlstidine, 
1-Isoleucine, K=Lysinei L^Leucine, M-Methionioe, 
N»Asparagine, P»Proliae, Q==^lutamine, R»Arginine, S^Serine, 
T^^reonine, V"Valine, W^ryptophan, Y°Tyrosine, 
X=Unluiown, "^^Stop codon, possible nucleotide deletion, 
\»possible nucleotide insertion 










ENSTTVKNSSLLSGFRGGSSYNHETETTFAI PRM 

QLDFKSIHVQEPQEPSLQDASLKPKVECSWTEF 

TDHICVTMDAELIMFLHDLVSAYLKEKEKAIFPP 

RILSTRPGQKSPIIIHDDNSSDKDREDSITYTTVDW 

RDFMCNTWHLEPTLRLISWTGRKIDPVGVDYILO 

KLGFHHARTTIPKWLQRGVMDPLDKVLSVLIKK 

LGTALQDEKEKKGKDKEEH 


3792 


A 


1 


364 


QNGSTPLHHAASK>JRHEIALMLLEGGANPDGKD 
HYEATAKHOATAKGNFICMIHILLYYKASTITODT 
EGNTPPHLVCD\RVEEAKLLVSQGA/SIYIENKEE 
KDP/LQVAKGALGLVLKRMVEG 


3793 


A 


2 


340 


DDIQFHHFSSEEALQKVKYFVAKEDPSSQEEAHT 
PEAPPPQPPSSERCLGEMKCTLVRGDSSPRQAEL 

KSGPASRPAL 


3794 


A 


421 


158 


SYWVGEDYTYKFFEVILIDPFHKAIRRNPDTQWI 
SKAVYKHREMCGLTSTGRKSHGLEKDRMFPHAI 
GGSCRA A ♦RRRKTLOFPC YH 


3795 


A 


24 


592 


GGMDSRVSGTTSNGETKPVYPVMEKKEEDGTLE 

RGHWNNKMEFVLSVAGEIIGLGNVWRFPYLCYK 

NGGGAFFIPYLVFLFTCGIPVFLLETALGQYTSQG 

GVTAWRKICPIFEGIGYASQMIVBLLNVYYnVLA 

WALFYLFSSFTIDLPWGGCYHEWNTEHCMEFQK 
TNGSLNGTSEN A TSPVTFF W 


3796 


A 


3 


592 


KPASTYSTSQPSMAPLLPIRTLPLILILLALLSPGA 

ADFNISSLSGLLSPALTESLLVALPPCHLTGGNAT 

LMVRRANDSKVVTSSFVVPPCRGRRELVSVVDS 

GAGFTVTRLSAYQVTNLVPGTKFYISYLVKKGT 

ATESSREIPMFTLPRRNMESIGLGMARTGGMVVI 

TYLLSVAMFLLVLGFIIALALGSRK 


3797 


A 


1 


1556 

* 


ATRLLRGSGSWGCSRLRFGPPAYRRFSSGGAYPN 

IPLSSPLPGVPKPVFATVDGQEKFETKVTTLDNGL 

RVASQNKFGQFCTVGILINSGSRYEAKYLSGIAH 

FLEKLAFSSTARFDSKDEILLTLEKHGGICDCQTS 

RDTTMYAVSADSKGLDTVVALLADVVLQPRLT 

DEEVEMTRMAVQFELEDLNLRPDPEPLLTEMIHE 

AAYRENTVGLHRFCPTENVAKINREVLHSYLRN 

YYTPDRMVLAGVGVEHEHLVDCARKYLLGVQP 

AWGSAEAVDIDRSVAQYTGGIAKLERDMSNVSL 

GPTPrPELTHTMVGLFSCSFT FEDFTPFAVT NMMM 

GGGGSFSAGGPGKGMFSRLYLNVLNRHHWMYN 

ATSYHHSYEDTGLLCIPIASADPRQVREMVEnTK 

EFILMGGTVDTVELERAKTQLTSMLMMNLESRP 

VIFEDVGRQVLATRSRKLPHELCTLIRNVKPEDV 

KRVASKMLRGKPAVAALGDLTDLPTYEHIQTAL 

SSKDGRLPRTYRLFR 


3798 


A 


73 


759 


KRLVEAGVPRTFDGIVGEGGAOSRSCWPWGVTA 

QTPAFSADSLNCLKNCMSITMGSVRPSVEQFHKY 

LPWFLNDRPNIKCPKGGLAAYSTSVNLTSDGOV 

LASRFMAYHKPLKNSQDYTEALRAARELAANIT 

ADLRKVPGTDPAFEVFPYTITNVFYEQYLTTLPEG 

LFMLSLCLVPTFAVSCLLLGLDLRSGLLNLLSrV 

MILVDTVGFMALWGISYNAVSLINLVS 


3799 


A 


73 


759 


KRLVEAGVPRTFDGIVGEGGAQSRSCWPWGVTA 
QTPAFSADSLNCLKNCMSITMGSVRPSVEQFHKY 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptloe 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
se()uence 


Amino acid sequence (A^AIanine OrCysteine, I>Aspartic Acid, 
E<=Glutamic Acid, F^^^Phenylalanine, G=Glycine, H=Histidine, 
I«Isoleacine, K=Lysine, L^Leucine, M^Methionine, 
N=Asparagine, PaProUnc, Q=Glutaniine, RBArginine, SsSerine, 
T^Threonine, V»Vallne, W»Tryptophan, YaTyroslne, 
X°Unknown, *'=Stop codon, /-possible nucleotide deletion, 
V=possible nucleotide insertion 










LPWFLNDRPNIKCPKGGLAAYSTSVNLTSDGQV 

LASRFMAYHKPLKNSQDYTEALRAARELAANIT 

ADLRKVPGTDPAFEVFPYTITNVFYEQYLTILPEG 

LFMLSLCLVPTFAVSCLLLGLDLRSGLLNLLSIV 

MILVDTVGFMALWGISYNAVSLINLVS 


3800 


A 


250 


1032 


GIFRSLRVLFPLFSVGRPQFARSLSAAPQLSDTAD 

TMGFGDLKSPAGLOVLNDYLADKSYIEGYVPSO 

ADVAVFEAVSSPPPADLCHALRAVYNHIKSYEKE 

KASLPGVKKALGKYGPADVEDTTGSGATDSKD 

DDDIDLFGSDDEEESEEAKRLREERLAQYESKKA 

KKPALVAKSSILLDVKPWDDETOMAKLEECVRS 

lOADGLVWGSSKLVPVGYGIKKLOIOCVVEDDK 

VGTOMLEEQITAFEDYVQSMDVAAFNKI 


3801 


A 


155 


656 


SREMELVTFRDVADBFSPEEWKCLDPAQQNLYR 

DVMLENYRNLVSLGFVISNPDLVTCLEQIKEPCN 

LKIHETAAKPPAICSPFSQDLSPVQGIEDSFHKLIL 

KRYEKCGHENLQLRKGCKRVNECKVQKGVNNG 

VYQCLSTTQSKIFQCNTCVRVFSTSSHSNKHK 


3802 


A 


1 


1428 


VTVSPETHMDLTKGCVTFEDIAIYFSQDEWGLLD 

EAQRLLYLEVMLENFALVASLGCGHGTEDEETP 

SDQNVSVGVSQSKAGSSTQKTQSCEMCVPVLKD 

ILHLADLPGQKPYLVGECTNHHQHQKHHSAKKS 

LKRDMDRASYVKCCLFCMSLKPFRKWEVGKDL 

PAMLRLLRSLVFPGGKKPGTITECGEDIRSQKSH 

YKSGECGKASRHKHTPVYHPRVYTGKKLYECSK 

CGKAFRGKYSLVQHQRVHTGERPWECNECGKF 

LVDHQKIHTGARPYECSQCGKSFSQKATLVKHQ 

RVHTGERPYKCGECGNSFSQSAILNQPIRRIHTGA 

KPYECGQCGKSFSQKATLIKHQRVHTGERPYKC 

GDCGKSFSQSSILIQHRRIHTGARPYECGQCGKSF 

SOKSGLIOHOVVHTGERPYECNKCGNSFSOCSSL 

IHHQKCHNT 


3803 


A 


193 


617 


LFPFLGSESKNGEADSSDKEMKHGQKSPTGKQTS 

OHLKRLKKSGLGHLKWTKAEDIDIETPGSILVNT 

NLRALINKHTFASLPQHFQQYLLLLLPEVDRQMG 

SDGILRLSTSALNNEFFAYAAQGWKQRLAEGKF 

VFSIIM 


3804 


A 


197 


479 


SSSRASPPEHPSSOAHCGPLVLSHACPEVTNKWS 
TGSSSSPNSS WVSSPLQPEGLSG SSRMKGGS ATKI 
LLETLLLAAHMTADQGIASSQRCLL 


380S 


A 


1 


385 


QSADTLFPGDINFNVSGLFSAVTLQDTVSDRLAS 
EELPSTAVPTPATTPAPAPAPAPATAPALVSAAT 
BCERTESEVPPRPASPKVTRSPPETAAPVEDMARR 

SELAVGGEEGTEGGRGEGTGSPMSSY 


3806 


A 


47 


1033 


LQGDTWHLSFLSHFSRLHGGVPGRGLLEGNLLQ 

PQAPGHDMTSIPFPGDRLLQVDGVILCGLTHKQA 

VQCLKGPGQVARLVLERRVPRSTQQCPSANDSM 

GDERTAVSLVTALPGRPSSCVSVTDGPKF*SSN* 

KRIANGLGFSFVQMEKESCSHLKSDLVRIKRLFP 

GHPAEENGAIAAGDIBLGREWEGPRKASSSRCRG 

SWAMQLSVQAGPSFASYYPAAVEVLHLLRGAPQ 

EVTLLLCRPPPGALPELEQEWQTPELSADKEFTR 

ATCTDSCTSPILGSRGQLGGTVPPQMQGKAWGL 

RPESSQKAIREGTMGAKTERDLGPVP 
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1 SEQID 
NO: 


Method 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to Urst amino 
acid residue of 
peptide 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A==Alanine C^Cysteine, D=A3partic Acid, 
£=Glutamic Acid, F=Phenyialanine, G==Glycine, H^Histidine, 
l=Isoleucine, K»Lysine, L^Leucine, M-Methionine, 
N^Asparagine, F^Proline, Q=Glatamlne, R^^Arginine, &==Serine, 
T=Threonine, V-Valine, W=Tryptophan, Y=Tyrosine, 
X»Unknown, *»$top codon, /^possible nucleotide deletion, 
V^possible nucleotide insertion 


3807 


A 


656 


1238 


RCPSLLPPSWPLPTLQTLTRTPGNKAIAGGAGLW 

AVLWGSEIITPPYR*GN*NQRGAVPCLRPHRLRP 

QDKFLVLASDGLWDMLSNEDVVRLWGHLAEA 

DWHKTDLAQRPANLGLMQSLLLQKKASGLHEA 

DQNAATRLIRHAIGNNEYGEMEAERLAAMLTLP 

EDLARMYRDDITVTVVYFNSESIGAYYKGG 


3808 


A 


26 


2195 


SQYSESVAGRQASPERLLGSYHAMASTVEGGDT 

ALLPEFPRGPLDAYRARASFSWKELALFTEGEG 

MLRFKKTIFSALENDPLFARSPGADLSLEKYREL 

NFLRCKRIFEYDFLSVEDMFKSPLKVPALIQCLG 

MYDSSLAAKYLLHSLVFGSAVYSSGSERHLTYIQ 

KIFRMEIFGCFALTELSHGSNTKAIRTTAHYDPAT 

EEFIIHSPDFEAAKFWVGNMGKTATHAWFAKL 

CVPGDQCHGLHPFIVQIRDPKTLLPMPGVMVGDI 

GKKLGQNGLDNGFAMFHKVRVPRQSLLNRMGr 

VTPEGTYVSPFKOVRQRFGASLGSLSSGRVSIVSL 

AILNLKLAVAIALRFSATRRQFGPTEEEEIPVLEY 

PMQQWRLLPYLAAVYALDHFSKSLFLDLVELQR 

GLASGDRSARQAELGREIHALASASKPLASWTF 

QQGIQECREACGGHGYLAMNRLGVLRDDNDPN 

CTYEGDNNILLQQTSNYLLGLLAHQVHDGACFR 

LAAYKWLVCYLLRETYQKLNQEKRSGSSDFEAR 

NKCQVSHGRPLALAFVELTWQRFHEHVHQPSV 

PPSLRAVLGRLSALYALWStSRHAALLYRGGYF 

SGEQAGEVLESAVLALCSQLKDDAVALVDVIAP 

PDFVLDSPIGRADGELYKNLWGAYLQESKVLER 

ASWWPEFSVNKPVIGSLKSBCL 


3809 


A 


117 

■ 


830 


CFGIMERVGCTLTTTYAHPRPTPTNFLPAISTMAS 

<sVRmiFPH<?MT TM<5T PWPP^TVYK'V AQ>J<sP<l\/ 

D I INX^iVrr iriolNJ_< 1. riOijOI-(i VVI\^ X I V Ai^lN Or O V 

APYCTRSQRVSENTMLPFVSNRTTFFTRYTPDDW 

YRSNLTNYQESNTSRHNSEKLRVDTSRLIQDKYQ 

QTRKTQADTTQNLGERVNDIGFWKSEIIHELDEM 

IGETNALTO\nKKRLERALMETEAPLQVARECLF 

HREBCRMGIDLVHDEVEAQLLTVNVGEMHQSQA 

A 


3810 


A 


3 


518 


VIQELEGGSGADLGEHSCRPASQPRFPRPAEARS 

HPATRRPASGPAMGKTNSKLAPEVLEDLVQNTE 

FSEQELKQWYKGFLKDCPSGILNLEEFQQLYIKF 

FPYGDASKFAQHAFRTFDKNGDGTIDFREFICAL 

SVTSRGSFEQKLNWAFEMYDLDGDGRITRLEML 

EIIE 


3811 


A 


81 


1147 


GCGYGCSGAGGAAIGEPMAKWGEGDPRWIVEE 

RADATNVNNWHWTTBRDASNWSTDKLKTLFLAV 

QVQNEEGKCEVTEVSKLDGEASINNRKGKLIFFY 

EWSVKLNWTGTSKSGVQYKGHVEIPNLSDENSV 

DEVEISVSLAKDEPDTNLVALMKEEGVKLLREA 

MGIYISTLKTEFTQGMILPTMNGESVDPVGQPAL 

KTEERKAKPAPSKTQARPVGVKIPTCKITLKETFL 

TSPEELYRVFTTQELVQAFTHAPATLEADRGGKF 

HMVIXjNVSGEFTDLWEKHIVMKWRFKSWPEG 

HFATITLTFIDKNGETELCMEGRGIPAPEEERTRQ 

GWQRYYFEGIKQTFGYGARLF 


3812 


A 


20 


558 


PCGTAASTHAYDRRAKCRQQQQQQQNGGQNKV 
RPAKKKTSPAREVSSESGTSGQFTPPSSTSVPTIAS 
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SEQID 
NO; 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A»Alanine OCysteine, S^Aspartic Acid, 
E=Glutamic Acid, F»Plieny lata nine, G=Glycine, H=Histidine, 
I«Isoleucine, K»Lysine, l^Leucine, M=Metliionine, 
N=Asparagine, ps=Proline, Q^GIutRininc, R-Arginine, S^Serine, 
T^Thrconlne, VaValine, W-Tryptophan, Y^Tyrosine^ 
X=Unknown, *=Stop codon, /-possible nucleotide deletion, 
\-possible nucleotide insertion 










SSAPVSIWSPASISPLSDPLSTSSSCMQRSYPMTYT 
QASGYSQGYAGSTSYFGGMDCGSYLTPMHHQL 
PGPGATLSPMGTNAVTSHLNQSPASLSTQGYGAS 
KLWGFNFNH 


3813 


A 


1 


1016 


CTEPPRRSTRTPAALASLRPYTDYVWSDQILQES 

EDFFTLIESHEGKPLKLMVYNSKSDSCREVTVTP 

NAAWGGEGSLGCGIGYGYLHRIPTQPPSYHKKPP. 

SDYMEALLQAPGSSMEDPLPGPGSPSHSAPDPDG 

LPHFMETPLQPPPPVQRVMDPGFLDVSGISLLDN 

SNASVWPSLPSSTELTTTAVSTSGPEDICSSSSSHE 

RGGEATWSGSEFEVSFLDSPGAQAQADIfLPQLT 

LPDSLTSAASPEDGLSAELLEAQARFRPASTEGLD 

TGTE AEGLDSQAQISTTE*HPGL* QGP 


3814 


A 


2 


884 


VFWQVRNAGSSPLSAACPLFRTPAPQPCGSWGR 
CCIPHASTGCRPMAERGELDLTGAKQNTGVWLV 

IfVPK'VT QOn W A V A QOP ni7 VnVT PTAVTTkOPTC 
V rN, I LtOKlK^ W /\lV/\oOrvVJJD V w JSlJUAJv 1 l^VJ JV 1 Ci 

VSFTLNEDLANIHDIGGKPASVSAPREHPFVLQSV 

GGQTLTVFTESSSDKLSLEGIVVQRAECRPAASE 

NYMRLKRLQIEBSSKPVRLSQQLDKVVTTNYKP 

FSAFEKHQYYNLKDLVDITKQPWYLKEILKJEIG 
VQNVKGIHKNTWELKPEYRHYQGEEKSD 


3815 


A 


17 


411 


NIGDWEDIGKSPERIIQYYGPATVAQDGSRGYCT 
PIYMLNHIIRLQAVLEIIMNERAmLDLLAQQTTK 
MRNANYQNRLALDYLLAHEGGV*GKFSLTNCC 
LEIDDNGKAIMEITARMRKLAHIPVQrWER 


3816 


A 


3 


1172 


SHWQRRDRRCVRNMAERGRKRPCGPGEHGQRI 

E\mLWQQKKEEKKKWKDLKLMKKLERQRAQ 

EEQAKRLEEEEAAAEKEDRGRPYTLSVALPGSIL 

DNAQSPELRTYLAGQIARACAIFCVDEIVVFDEE 

GQDAKTVEGEFTGVGKKGQACVQLARILQYLEC 
povi RK" APPPK'MnnT OP Ant T MPT c^puT-TN/fP r\r\ 

EESEFREGVVVDRPTRPGHGSFVNCGMKKEVKl 

DKNLEPGLRVTVRLNQQQHPDCKTYHGKWSS 

QDPRTKAGLYWGYTVRLASCLSAVFAEAPFQDG 

YDLTIGTSERGSDVASAQLPNFRHALVVFGGLQG 

LEAGADADPNLEVAEPSVLFDLYVNTCPGQGSR 

TIRTEEAILISLAALQPGLIQAGARHT 


3817 


A 


246 


1197 


FLSAGMSNFTHYAYLLMIESLMLGKVPPHVPSH 

HFIFHDDGSARQKGESDYKVIIQQWFSKSGPWTT 

SSNVTWGLLELQQSISESAVLTIPPGDSGAGSNLI 

KTEQPGEPLEHVYVTEKHAVALESRHQKGELQC 

LIKMCIPLSKPLQMFFSPPHWEAWLQRVQQLAK 

NTRYFRQRLQEMGFIIYGNENASWPLLLYMPG 

KVAAFARHMLEKKIGVVWGFPATPLAEARARF 

CVSAAHTREMLDTVLEALDEMGDLLQLKYSRH 

KKSARPELYDETSFELED 


3818 


A 


215 


789 


MPQSSSSEGSSEIFQVNGHNRLLVQRSEVTQAPG 

QYTVD VEGHGCTFIQATLK YNVLLPKKA SGFSLS 

LEIVKNYSSTAFDLTVTLKYTGIRNKSSMVVIDV 

KMLSGFTPTMSSffiELENXGQVMKTEVKNDHVL 

FYLENVFGRADSFTFSVEQSNLVFNIQPAPGMVY 

DYYEKEEYALAFYHINSSSVSE 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Aianine C=Cysteinc, D=Aspartic Acid, 
E=Glutamic Add, F-Phenylalanine, G=Clyciue, H^HisHdine, 
I=Isoleucine, K=Lysine, L?»Leucine, M=Methionine, 
N°Asparagine> P^Proltne, Q=Glutainine, R-Arginine, S^Scrine, 
T^Threonine, V=Valine, W=^ryptophan, Y=1^rosine, 
X=lInknown, *=Stop codon, /=possible nucleotide deletion, 
\»possiblc nucleotide insertion 


3819 


A 


1 


1483 


RIPDSIISRGVQGLPRDTASLSTTPSESPRAQATSR 

LSTASCPTPKVQSRCSSKENILRASHSAVDITKVA 

RRHRMSPFPLTSMDKAFITVLEMTPVLGTEIINYR 

DGMGRVLAQDVYAKDNLPPFPASVKDGYAVRA 

ADGPGDRFIIGESQAGEQPTQTVMPGQVMRVTT 

GAPIPCGADAWQVEDTELIRESDDGTEELEVRIL 

VQARPGQDIRPIGHDIKRGECVLAKGTHMGPSEI 

GLLATVGVTEVEVNKFPVVAVMSTGNELLNPED 

PDDLLNALNEGISRADVIITSGGVSMGEKDYLKQ 

VLDIDLHAQIHFGRVFMKPGLPTTFATLDIDGVR 

KIIFALPGNPVSAVVTCNLFWPALRKMQGILDP 

RPTIIKARLSCDVKLDPRPEYHRCILTWHHQEPLP 

WAQSTGNQMSSRLMSMRSANGLLMLPPKTEQY 

VELHKGEVVDVMVIGRL 


3820 


A 


2216 


487 


PQEPALKSEFSQVASNTIPLPLPQPNTCKDNGPCK 

QVCSTVGGSAICSCFPGYAIMADGVSCEDQDECL 

MGAHDCSRRQFCVNTLGSFYCVNHTVLCADGyi 

LNAHRKCVDINECVTDLHTCSRGEHCVNTLGSF 

HCYKALTCEPGYALKDGECEDVDECAMGTHTC 

QPGFLCQNTKGSFYCQARQRCMDGFLQDPEGNC 

VDINECTSLSEPCRPGFSCINTVGSYTCQRNPLIC 

ARGYHASDDGTKCVDVNECETGVHRCGEGQVC 

HNLPGSYRCDCKAGFQRDAFGRGCEDVNECWAS 

PGRLCQHTCENTLGSYRCSCASGFLLAADGKRC 

EDVNECEAQRCSQECANIYGSYQCYCRQGYQLA 

tJJvjrrl 1 U 1 IJJLlJrLi^Al^OAVJiJLv^ 1 rxvULN Vrljo Y l^LA 

CPEQGYTMTANGRSCKDVDECALGTHNCSEAET 

CHNIQGSFRCLRFECPPNYVQVSKTKCERTTCHD 

FLECQNSPARITHYQLNFQTGLLVPAHIFRIGPAP 

AFTGDTIALNIIKGNEEGYFGTRELNAYTGVVYL 

QRAVLEPRDFALDVEMKLWRQGSVTTFLAKMHl 

FFriFAL 


3821 


A 


2216 


487 


PQEPALKSEFSQVASNTPLPLPQPNTCKDNGPCK 

QVCSTVGGSAICSCFPGYAIMADGVSCEDQDECL 

MGAHDCSRRQFCVNTLGSFYCVNHTVLCADGYI 

LNAHRKCVDINECVTDLHTCSRGEHCVNTLGSF 

HCYKALTCEPGYALKDGECEDVDECAMGTHTC 

QPGFLCQNTKGSFYCQARQRCMDGFLQDPEGNC 

VDINECTSLSEPCRPGFSCINTVGSYTCQRNPLIC 

ARGYHASDDGTKCVDVNECETGVHRCGEGQVC 

HNLPGSYRCDCKAGFQRDAFGRGCIDVNECWAS - 

PGRLCQHTCENTLGSYRCSCASGFLLAADGKRC 

EDVNECEAQRCSQECANIYGSYQCYCRQGYQLA 

CPEQGYTMTANGRSCKDVDECALGTHNCSEAET 

CHNIQGSFRCLRFECPPNYVQVSKTKCERTTCHD 

FLECQNSPARITHYQtNFQTGLLVPAHIFRIGPAP 

AFTGDTIALNIlKGNEEGYFGTREtLNAYTGWYL 

QRAVLEPRDFALDVEMKLWRQGSVTTFLAKMHI 

FFTTFAL 


3822 


A 


2502 


1540 


MAAATRGCRPWGSLLGLLGLVSAAAAAWDLAS 
LRCTLGAFCECDFRPDLPGLECDLAQHLAGQHL 
AKALVVKALKAFVRDPAPTKPLVLSLHGWTGTG 
KSYVSSLLAHYLFQGGLRSPRVHHFSPVLHFPHP 
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SEQlD 
NO: 


Method 


Predicted 

beginning 

nacleodde 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, D^Aspartic Acid, 
£°Glut8mic Acid, F<=Pheaylalanine, G^lycine, H^Histidine, 
^Jbolendnc, K^Lysine, LF>Leucine» M«IVIethionine, 
N»Asparagine, F»Proline, Q»GIutamloe, R«Argiuine, S^Serine^ 
T=Thrconlne, V=«Vallne, W=Tryptophan, Y=Tyrosine, 
X»Unknown, *=Stop codon, /possible nucleotide deletion, 
\=possible nucleotide insertion 








• 


SHIERYKKDLKSWVQGNLTACGRSLFLFDEMDK 

MPPGLMEVLRPFLGSSWWYGTNYRKAIFIFISN 

TGGEQINQVALEAWRSRRDREEILLQELEPVISR 

AVLDNPHHGFSNSGIMEERLLDAVVPFLPLQRHH 

VRHCVLNELAQLGLEPRDEVVQAVLDSTTFFPE 

DEQLFSSNGCKTVASRIAFFL 


3823 


A 


1 


3174 


YGCEKTTEGRIPLKNIYRLFSADRKRVETALEAC 

SLPSSRNDSIPQEDFTPEVYRVFLNNLCPRPEIDNI 

FSEFGAKSKPYLTVDQMMDFINLKQRDPRLNEIL 

YPPLKQEQVQVLIEKYEPNNSLARKGQISVDGFM 

RYLSGEENGVVSPEKLDLNEDMSQPLSHYFINSS 

HNTYLTAGQLAGNSSVEMYRQVLLSGCRCVELD 

CWKGRTAEEEPVITHGFTMTTEISFKEVIEAIAEC 

AFKTSPFPELLSFENHVDSPKQQAKMAEYCRLIFG 

DALLMEPLEKYPLESGVPLPSPMDLMYKILVKN 

KKKSHKSSEGSGKKKLSEQASNTYSDSSSMFEPS 

SPGAGEADTESDDDDDDDDCKKSSMDEGTAGSE 

ANUTEEMSNLWYIQPVKFESFEISKKR24KSFEM 

SSFVETKGLEQLTKSPVEFVEYNKMQLSRIYPKG 

TRVDSSNYMPQLFWNAGCQMVALNFQTMDLA 

MQINMGMYEYNGKSGYRLKPEFMRRPDKHFDP 

FTEGJVDGIVANTLSVKJISGQFLSDKKVGTYVEV 

DMFGLPVDTRRKAFKTKTSQGNAVNPVWEEEPI 

VFKKWLPTLACLRIAVYEEGGKFIGHRILPVQAI 

RPGYHYICLRNERNQPLTLPAVFVYIEVKDYVPD 

TYADVIEALSNPIRYVNLMEQRAKQLAALTLEDE 

EEVKKEADPGETPSEAPSEARTTPAENGVNHTTT 

LTPKPPSQALHSQPAPGSVKAPAKTEDLIQSVLTE 

VEAQTffiELKQQKSFVKLQKKHYKEMKDLVKR 

HHKKTTDLIKEHTTKYNEIQNDYIJIRRAALEKS 

AKKDSKKKSEPSSPDHGSSTIEQDLAALDAEMTQ 

KLIDLKDKOOOOLLNLROEOYYSEKYOKREHIK 

LLIQKLTDVAEECQNNQLKKLKEICEKEKKELKK 

KMDKKRQEKITEAKSKDKSQMEEEKTEMIRSYI 

QEWQYIKRLEEAQSKRQEKLVEKHKEIRQQILD 

EKPKLQVELEQEYQDKFKRLPLEILEFVQEAMKG 

KISEDSNHGSAPLSLSSDPGKVNHKTPSSEELGGD 

IPGKEFDTPL 


3824 


A 


1 


426 


ILHWFVHRWSGRNNREKIGVHVGFEEILNMEPY 

CCRETLKSLRPECFIYDLSAVVMHHGKGFGSGH 

YTAYCYNSEGGFWVHCNDSKLSMCTMDEVCKA 

QAYILFYTQRVTENGHSKLLPPELLLGSQHPNED 

ADTSSNEILS 


3825 


A 


3 


364 


GIRAKFPNKJPVVVERYPRETFLPPLDKTKFLVPQ 
ELTMTQFLSIIRSRMVLRATEAFYLLVNNKSLVS 
MSATMAEIYRDYKDEDGFVYMTYASQETFGCLE 
SAAPRDGSSLEDRPLHPL 


3826 


A 


1 


1237 


PEKKFERECREAEKAQQSYERLDNDTNATKADV 

EKAKQQLNLRTHMADENICNEYAAQLQNFNGEQ 

HKHFYWIPQIYKQLQEMDERRTIKLSECYRGFA 

DSERKVIPIISKCLEGMILAAKSVDERRDSQMW 

DSFKSGFEPPGDFPFEDYSQHIYRTISDGTISASKQ 

ESGKl^AKTTVGKAKGKLWLFGKKPKGPALED 

FSHLPPEQRRKKLQQRIDELNRELQKESDQKDAL 

>nCMKDVYEKl^QMGDPGSLQPKIj\ETMN^ 
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S£QID 
NO: 


Method 


Predicted 
beginning 
nucleotide 
location * 
corresponding 
to flrst amino 
acid residue of 
pepnae 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
seijnence 


Amino acid sequence (A=AIanine C=Cystelne, D=>A5partic Add, 
E=Glutamic Acid, F=Phenylalanine, G^GIycine, H=Histidine, 
I*=lsoleucine, K=>Lysine, LF*Leucine, M^Mettiionine, 
N»Asparagine, P»Proline, Q=Glutaminer R=Argintne, S»Serine, 
T=>Threonine, V^Valine, W=Tryptophan, Y=Tyroslne, 
X=Unknown, *=Stop codon» /^possible nucleotide deletion, 
\F^possible nucleotide insertion 










LRMEIHKNEAWLSEVEGKTGGRGDRRHSSDINH 
LVTQGRESPEGSYTDDANQEVRGPPQQHGHHNE 
FDDEFEDDDPLPAIGHCKAIYPFDGHNEGTLAMK 
EGEVLYIIEEDKGDGWTRARRQNGEEGYVPTSYI 
DVTLEKNSKGS 


3827 


A 


2 


1584 


INPVSSAVNGEAHSSHETRGQNSNALPSVLLELL 

SQSCLIPAMSSYLRNDSVLDMARHVPLYRALLEL 

LRAIASCAAMVPLLLPLSTENGEEEEEQSECQTS 

VGTLLAKMKTCVDTYTNRLRSKjRENVKTGVKP 

DASDQEPEGLTLLVPDIQKTAEIVYAATTSLRQA 

NQEKKLGEYSKKAAMKPKPLSVLKSLEEKYVAV 

MKKLQFDTFEMVSEDEDGKLGFKVNYHYMSQV 

KNANDANSAARARRLAQEAVTLSTSLPLSSSSSV 

FVRCDEERLDIMKVLITGPADTPYANGCFEFDVY 

FPODYPSSPPLVNLETTGGHSVRFNPNLYNDGKV 

CLSILNTWHGRPEEKWNPQTSSFLQVLVSVQSLI 

LVAEPYFNEPGYERSRGTPSGTQSSREYDGNIRQ 

ATVKWAMLEQIRNPSPCFKEVIHKHFYLKRVEIM 

AQCEEWIADIQQYSSDKRVGRTMSHHAAALKRH 

TAQLREELLKLPCPEGLDPDTDDAPEVCRATTGA 

EETLMHDOVKPSSSKELPSDFOL 


3828 


A 


1415 


845 


PRVPATLVSLDPWHCFPTAGRLAGSTWPPACT 

LQLGPSSEHELDNHRAPLLSLPSQESLSFTPWYLV 

ACKPLFHIFCPLFACFMQEGKVQYLFLHLSHMRL 

LNYYFFPFLAPESLMQALEDLDYLAALDNDGNL 

SEFGIIMSEFPLDPOLSKSILASCEFDCVDEVLTIA 

AMVTGILNDYSFSFFANLH 


3829 


A 


199 


683 


VDHTPVLSKPQCFSSVKWGATLSARSQKTSGIGR 
LMVHVIEATELKACKPNGKSNPYCEISMGSQSYT 
TRTIQDTLNPKWNFNCQFFIKDLYQDVLCLTLFD 
RDQFSPDDFLGRTEIPVAKIRTEQESKGPMTRRLL 
LHEVPTGEVWVRFDLQLFEQKTLL 


3830 


A 


1747 


404 


RKMMEESGIETTPPGTPPPNPAGLAATAMSSTPV 

PLAATSSFSSPNVSSMESFPPLAYSTPQPPLPPVRP 

SAPLPFVPPPAVPSVPPLVTSMPPPVSPSTAAAFG 

NPPVSHFPPSTSAPNTLLPAPPSGPPISGFSVGSTY 

DITRGHAGRAPQTPLMPSFSAPSGTGLLPTPITQQ 

ASLTSLAQGTGTTSAITFPEEQEDPRITRGQDEAS 

AGGlWGFIKGVAGNPIVfVKSVLDKTiCHSVESMIT 

TLDPGMAPYIKSGGELDIVVTSNKEVKVAAVRD 

AFQEVFGLAVWGEAGQSNIAPQPVGYAAGLKG 

AQERIDSLRRTGVIHEKQTAVSVENFIAELLPDK 

WFDIGCLWEDPVHGIHLETFTQATPVPLEFVQQ 

AQSLTPQDYNLRWSGLLVTVGEVLEKSLLNVSR 

TDWHlVLy^TGMSRRQMIYSAARAIAGMYKQRLP 

PRTV 


3831 


A 


5 


674 


FWTRSAWHEGLOOMKANDPSLOEVNLYNIKNIP 

IPTLREFAKALETNTHVKKFSLAATRSNDPVAIAF 

ADMLKVNTTLTSLNffiSHFITGTGILALVEALKEN 

DTLTEKIDNQRQQLGTAVEMEIAQMLEENSRIL 

KFGYQFTKQGPRTRVAAAITKNNDLAWQKDTQ 

EQTSIWQWSQSIAGFNPQFEVQGQNARSWMEE 

LGKAFHQFVRRELKQTEGKLP 


3832 


A 


164 


782 


EPWVPMDVAESPERDPHSPEDEEQPQGLSDDDIL 
RDSGSDQDLDGAGVRASDLEDEESAARGPSQEE 
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SEQH) 
NO: 


Method 


Predicted 

beginning 

nucleoHde 

locatioii 

corresponding 

to first amino 

acid residue of 

Vk^V 

pepuoc 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, D=Aspartic Acid, 
£=Glutamic Acid, F=Pbenyl alanine, G'^Glycine, H=Histidlne, 
I=Isoleucine, K-Lysine, L^'Leudne, M^Methionine, 
N=>Asparaglne, P^ProllDC, Q^Glutamine, R»Arginine, S»Serine, 
T^Threonine, V=VaIine, W=Tryptophan, Y^iyrosine, 
X='Unlinown, *^top codon, /-possible nucleotide deletion, 
\spossib1e nucleotide insertton 










EDNHSDEEDRASEPKSQDQDSEVNELSRGPTSSP 

CEEEGDEGEEDRTSDLRDEASSVTRELDEHELDY 

DEEVPEEPAPAVQEDEAEKAGAEDDEEKGEGTP 

REEGKAGVQSVGEKESLEAAKEKKKEDDDGEID 

DEEMY 


3833 


A 


122 


1676 


SQPPHFTQKMNENKDTDSKKSEEYEDDFEKDLE 

WLINENEKSDASIIEMACEKEENINQDLKENETV 

MEHTKRHSDPDKSLQDEVSPRRNDIISVPGIQPLD 

PISDSDSENSFQESKLESQKDLEEEEDEEVRRYIM 

EKIVQANKLLQNQEPVM)KRERKLKFKDQLVDL 

EVPPLEDTTTSKKYFENERNMFGKLSQLCISNDF 

GQEDVLLSLTNGSCEENKDRTILVERDGKFELLN 

LQDIASQGFLPPINNANSTENDPQQLLPRSSNSSV 

SGTKKEDSTAKIHAVTHSSTGEPLAYIAQPPLNR 

KTCPSSAVNSDRSKGNGKSKHRTOSAHISPVTST 

YCLSPRQKELQKQLEEICREKLKREEERRKIEEEK 

EKKRENDIVFKAWLOKXREOVLEMRRIORAKEI 

EDMNSROENRDPOOAFRLWLKKKHEEOMKERO 

TEELRKQEECLFPLKGTEGRERAFKQWLRRKRM 

EKMAEQQAVRERTRQLRLEAKRSKQLQHHLYM 

SEAKPFRFTDHYN 


3834 


A 


575 


774 


RSRTEELSNSGILKAMSKDLVTFGDVAVNFSQEE 
WEWLNPAQRNLYRKVMLENYRSLVSLGKDMSP 


3835 


A 


2 


100 


ASDFYLRYYVGHKGKFGHEFLEFEFRPDGVYV 




A 


91 


749 


RPTPGHGDFWMOPLTICDAGMST A^iALOV 

RGEALSEEEIWSLLFLAAEQLLEDLRNDSSDYVV 

CPWSALLSAAGSLSFQGRVSHIEAAPFKAPELLQ 

GQSEDEQPDASQMHVYSLGMTLYWSAGFHVPP 

HQPLQLCEPLHSILLTMCEDQPHRRCTLQSVLEA 

CRVHEKEVSVYPAPAGLHIRRLVGLVLGTISEVS 

REPCFSSSSCWSCVAIKI 


3837 


A 


3 


1214 


SLGCTNSARGKGQDDEVRTLMANGAPFTTDWFS 

KLRVSCGYIGDNCKNGADVNAKDMLKMTALH 

WATERHHRDVVELLIKYGADVHAFSKFDKSAFD 

lALEKNNAEILVILQEAMQNQVNVNPERANPVTD 

PVSMAAPFIFTSGEWNLASLISSTNTKTTSGDPH 

ASTVOFSNSTTSVLATLAALAEASVPLSNSHRAT 

ANTEEIIEGNSVDSSIQQVMGSGGQRVITIVTDGV 

PLGNIQTSIPTGGIGHPFIVTVQDGQQVLTVPAGK 

VAEETVIKEEEEEKLPLTKKPRIGEKTNSVEESKE 

GNERELLQQQLQEANRRAQEYRHQLLKKEQEAE 

QYRLKLEAIARQQPNGVDFTMVEEVAEVDAVV 

VTEGELEERETKVTGSAGATGPPTRVSMATVSS 


3838 


A 


1 


1332 


MIEDNKENKDHSLERGRASLIFSLKNEVGGLIKA 

LKIFQEKHVNLLMESRKSKRRNSEFEIFVDCDIN 

REQLM)IFHLLKSHTNVLSVNLPDNFTLKEDGME 

TVPWFPKKISDLDHCANRVLMYGSELDADHPGF 

KDNVYRKRRKYFADLAMNYKHGDPIPKVEFTEE 

EIKTWGTVFQELWKLYPTHACREYLKNLPLLSKY 

CGYREDNIPQLEDVSNFLKERTGFSIRPVAGYLSP 

EIDFLSGLAFRVFHCTQYVRHSSDPFYTPEPDTCH 

ELLGHVPLLAEPSFAQFSQEIGLASLGASEEAVQ 

KLATCYFFTVEFGLCKQDGQLRVFGAGLLSSISE 

LKHALSGHAKVKPFDPKITCKQECLITTFQDVYF 

VSESFEDAKEKMREFTKTIKRPFGVKYNPYTRSI 
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SEQ ID 

NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C^ysteine, D=«Aspartic Add, 
EKSlutamic Acid, F^Pbenyialanine, G^GIyane, H-Histidine, 
I~lsoleuclne, K-Lysine, L^Leucine, M==>Methionine, 
rS=Asparagine, P=Prolinc, Q=Glutamine, R^Arginine, S^erinc, 
T=Thrconine, V=Vallne, W=»Tryptophan, Y-Tyrosine, 
X^Unknown, *^top codon, /=possible nucleotide deletion> 
\spossible nucleotide insertion 










QILKDTKSITSAMNELQHDLDWSDALAKVSRKP 
SI 


3839 


A 


3093 


520 


MS^TVDQIRAIMDKKANmNMSVIAHVDHGKS 

TLTDSLVCKAGIIASARAGETRFTDTRKDEQERCI 

TIKSTAISLFYELSENDLNFIKQSKDGAGFLINLID 

SPGHVDFSSEVTAALRVTDGALVVVDCVSGVCV 

QTETVLRQAIAERIKPVLMMNKMDRALLELQLE 

PEELYQTFQRIVENVNVIISTYGEGESGPMGNIMI 

DPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFA 

AKGEGQLGPAERAKKVEDMMKKLWGDRYFDP 

ANGKFSKSATSPEGKKLPRTFCQLILDPIFKVFDA 

IMNFKKEETAKLIEKLDIKLDSEDKDKEGKPLLK 

AVMRRWLPAGDALLQMITIHLPSPVTAQKYRCE 

LLYEGPPDDEAAMGIKSCDPKGPLMMYISKMVP 

TSDKGRFYAFGRVFSGLVSTGLKVREMGPNYTPG 

KKEDLYLKPIQRTILMMGRYVEPIEDVPCGNrVG 

LVGVDQFLVKTGTITTFEHAHNMRVMKFSVSPV 

VRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCI 

lEESGEHIIAGAGELHLEICLKDLEEDHACIPIKKS 

DPVVSYRETVSEESma.CLSKSPNKHbna.YMKA 

RPFPDGLAEDIDKGEVSARQELKQRARYLAEKY 

EWDVAEARKIWCFGPDGTGPNILTDITKGVQYL 

IffilKDSWAGFQWATKEGALCEENMRGVRFDV 

HDVTLHADAIHRGGGQnPTARRCLYASVLTAQP 

RLMEPIYLVEIQCPEQVVGGIYGVLNRKRGHVFE 

ESQVAGTPMFWKAYLPVNESFGFTADLRSNTG 

GQAFPQCVFDHWQILPGDPFDNSSRPSQVVAETR 

KRKGLKEGIPALDNFLDKL 


3840 


A 


2 


753 


SSTRSRDFCCSEAIQGSLTRRERRASGVRTRRSQG 

SSAMASKILLNVQEEVTCPICLELLTIBPLSLDCGH 

SLCEIACITVSNKEAVTSMGGKSSCPVCGISYSFE 

HLQANQHLANIVERLKEVKLSPDNGKKRDLCDH 

HGEKLLLFCKEDRKVICWLCERSQEHRGHHTVL 

TEEVFKECQEKLQAVLKRLKKEEEEAEKLEADIR 

EEKTSWKYQVQTERQRIQTEFDQLRSILNNEEQR 

ELQRLEEEEKKT 


3841 


A 


2 


405 


GKAFSCFTYLSQHRRTHMAEKPYECKTCKKAFS 

HFGNLKVHERIHTGEKPYECKECRKAFSWLTCL 

LRHERJHTGKKSYECQQCGKAFTRSRFLRGHEKT 

HTGEKMHECKECGKALSSLSSLHRHKRTHWRDT 

L 


'3842 


A 


311 


88 


AVLKNMAPMTALGLLDLHILNLILFLSAGEDFTS 
VVSEIMMYILLVFLTLWLLffiMIYCYRKVSKAEE 
AAQENA 


3843 


A 


3 


1175 


APIRNSRIDDFVRRVESKATSARCGLWGSGPRRR 

PASGMFRGLSSWLGLQQPVAGGGQPNGDAPPEQ 

PSETVAESAEEELQQAGDQELLHQAKDFGNYLF 

NFASAATKKITESVAETAQTIKKSVEEGKIDGIID 

KTIIGDFQKEQKKFVEEQHTKKSEAAVPPWVDT 

NDEETIQQQILALSADKRNFLRDPPAGVQFNFDF 

DQMYPVALVMLQEDELLSKMRFALVPKLVKEE 

VFWRNYFYRVSLIKQSAQLTALAAQQQAAGKEE 

KSNGREQDLPLAEAVRPKTPPVVIKSQLKTQEDE 

EEISTSPGVSEFVSDAFDACNLNQEDLRKEMEQL 

VLDKKQEETAVLEEDSADWEKELQQELQEYEV 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A-Alanine C^Cysteine, D^Aspartic Acid, 
£=Glutamic Acid, F^Phenylalanine, G^lycine, H=Mistidine» 
i»I$oleucine, K=»Lysioe, L»Leucine, M<°Methionine, 
N=Asparagine, F^Froline, Q=Glatainine, RaArginine, S^Serine, 
T^Tbreonine^ V=«ValiDe, W-Tiyptopban, Y=Tyrosine, 
X«»Unl(noivn, ^'^^top codon, /=^ossib!e nucleotide deletion, 
V^possible nucleotide insertion 










VTESEKRDENWDKEIEKMLQEEN 


3844 


A 


798 


148 


LPPAOIPEAWLLLANVWVLILVPLKDRLTDPLLL 

RCKLLPSALQKMALGMFFGFTSVIVAGVLEMER 

LHYIHHNETVSQQIGEVLYNAAPLSIWWQIPQYL 

LIGISEIFASPGLEFAYSEAPRSMQGAIMGIFFCLS 

GVGSLLGSSLVALLSLPGGWLHCPKDFGNINNCR 

MDLYFFLLAGIQAVTALLFVWIAGRYERASQGP 

ASHSRFSRDRG 


3845 


A 


3 


1934 


PEDSAPQYSRLFPNASQHITPSYNYAPNPDKHWI 

MRYTGPMKPIHMEFTNMLQRIGILQTLMSVDDS 

METIYNMLVETGELDNTYIVYTADHGYfflGQFG 

LVKGKSMPYEFDIRVPFYVRGPNVEAGCLNPHIV 

L>nDLAPTILDIAGLDIPADMDGKSILKLLDTERP 

VNRFHLKKKmVWRDSFLVERGKLLHKRDh^ 

VDAQEENFLPKYQRVKDLCQRAEYQTACEQLG 

QKWQCVEDATGKLKLHKCKGPMRLGGSRALSN 

LVPKYYGQGSEACTCDSGDYKLSLAGRRKKLFK 

KKYKASYVRSRSIRSVAIEVDGRVYHVGLGDAA 

QPRNLTKKHWPGAPEDQDDKDGGDFSGTGGLP 

DYSAANPIKVTHRCYILENDTVQCDLDLYKSLQ 

AWKDHKLHIDHEIETLQKKIKNLREVRGHLKKK 

RPPPPf^PHKT^VHTOWK'nPT l^WPfl^QT HPRPK'Ol 
ixjrcctv^jL/^njvLo X Hi v^njv\jjEvJ~«rkXU\.vjodLyriirr ivrkUi^ 

QEBCDKVWLLREQKRKKKLRKLLKRLQNNDTCS 

MPGLTCFTHDNQHWQTAPFWTLGPFCACTSAN 

NNTYWCMRTENETHNFLFCEFATGFLEYFDLNT 

DPYQLMNAVNTLDRDVLNQLHVQLMELRSCKG 

YKQCNPRTRNMDLGLKDGGSYEQYRQFQRRKW 

PEMKRPSSKSLGQLWEGWEG 


3846 


A 


3 


1934 


PEDSAPQYSRLFPNASQHITPSYNYAPNPDKHWI 

MRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDS 

METIYNMLVETGELDNTYIVYTADHGYHIGQFG 

LVKGKSMPYEFDIRVPFYVRGPNVEAGCLNPHIV 

LNIDLAPTILDIAGLDIPADMDGKSILKLLDTERP 

VNRFHLKKKMRVWRDSFLVERGKLLHKRDNDK 

VDAQEENFLPKYQRVKDLCQRAEYQTACEQLG 

QKWQCVEDATGKLKLHKCKGPMRLGGSRALSN 

LVPKYYGQG SEACTCDSGD YKLSLAGRRKKLFK 

KKYKASYVRSRSIRSVAIEVDGRVYHVGLGDAA 

QPRNLTKRHWPGAPEDQDDKDGGDFSGTGGLP 

DYSAANPIKVTHRCYILENDTVQCDLDLYKSLQ 

AWKDHKLMDHEffiTLQl^KNLREVRGHLKKK 

RPEECDCHKISYHTOHKGRLKHRGSSLHPFRKGL 

QEKDKVWLLREQKRKKKLRKLLKRLQNNDTCS 

MPGLTCFTHDNQHWQTAPFWTLGPFCACTSAN 

NNTYWCMRTINETHNFLFCEFATGFLEYFDLKT 

DPYQLMNAVNTLDRDVLNQLHVQLMELRSCKG 

YKQCNPRTRNMDLGLKDGGSYEQYRQFQRRKW 

PEMKRPSSKSLGQLWEGWEG 


3847 


A 


1 


1257 


MWSAVLTAFHTGTSNTTFVVYENTYMNITLPPP 

FQHPDLSPLLRYSFETMAPTGLSSLTVNSTAVPTT 

PAAFKSLNLPLQITLSAIMIFILFVSFLGNLVVCLM 

VYQKAAMRSAINILLASLAFADMLLAVLNMPFA 

LVnLTTRWIFGKFFCRVSAMFFWLFVIEGVAILL 

IISIDRFLIIVQRQDKLNPYRAKVLUVSWATSFCV 

AFPLAVGNPDLQIPSRAPQCVFGYTTNPGYQAYV 
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SEQID 
NO: 


Method 

♦ 


Predicted 
beginning 
nucleotide 
location 

corresponding 
to first amino, 
acid residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine OK^ysteine, D»Aspartic Acid, 
E^Giutamlc Acidt F=rlienylaiaiilne« G^^alycine, n=Histidine, 
t^Isoleuclne, K^Lysine, I/-LeudDe» MNViethionine, 
N=A5panigine, Ps=Prolinc, Q=Glutaniinc, R»Arginine, S=>Serine, 
T=Threoninc, V^Valine, W=Tryptophan, Y=Tyroslnc, 
X=Unlcnown, ^'^top codon, /-possible nucleotide deletion, 
V^possible nucleotide insertion 










ILISLISFFIPFLVILYSFMGILNTLRHNALRIHSYPE 

GICLSQASKLGLMGLQRPFQMSIDMGFKTRAFTT 

ILILFAVFIVCWAPFTTYSLVATFSKHFYYQHNFF 

EISTWLLWLCYLKSALWLIYYWRIKKFHDACLD 

1VD!4PKSFKFLPQIJ>GHTKRRIRPSAVYVCGEHRT 

VV 


3848 


A 


3 


2827 


SSAVAARRRRSWASLVLAFLGVCLGITLAVDRS 

NFKTCEESSFCBCRQRSIRPGLSPYRALLDSLQLGP 

DSLTVHLIHEVTKVLLVLELQGLQKNMTRFRIDE 

LEPRRPRYRVPDVLVADPPIARLSVSGRDENSVE 

LTMAEGPYKIILTARPFRLDLLEDRSLLLSVNARG 

LLEFEHQRAPRVSQGSKDPAEGDGAQPEETPRD 

GDKPEETQGKAEKDEPGAWEETFKTHSDSKPYG 

PMSVGLDFSLPGMEHVYGIPEHADNLRLKVTEG 

GEPYRLYNLDVFQYELYNPMALYGSVPVLLAHN 

PHRDLGIFWLNAAETWVDISSNTAGKTLFGKMM 

DYLQGSGETPQTDVRWMSETGIIDVFLLLGPSISD 

VFRQYASLTGTQALPPLFSLGYHQSRWNYRDEA 

DVLEVDQGFDDHNLPCDVIWLDIEHADGKRYFT 

WDPSRFPQPRTMLERLASKRRKLVAIVDPHIKVD 

SGYRVHEELRNLGLYVKTRDGSDYEGWCWPGS 

AGYPDFTNPTMRAWWANMFSYDNYEGSAPNLF 

VWNDMNEPSVFNGPEVTMLKDAQHYGGWEHR 

DVHNIYGLYVHMATADGLRQRSGGMERPFVLA 

RAFFAGSQRFGAVWTGDNTAEWDHLKISIPMCL 

SLGLVGLSFCGADVGGFFKNPEPELLVRWYQMG 

AYQPFFRAHAHLDTGRREPWLLPSQHNDIIRDAL 

GQRYSLLPFWYTLLYQAHREGIPVMRPLWVQYP 

QDVTTFNIDDQYLLGDALLVHPVSDSGAHGVQV 

YLPGQGEVWYDIQSYQKHHGPQTLYLPVTLSSIP 

WQRGGTIVPRWMRVRRSSECMKDDPITLFVALS 

PQGTAQGELFLDDGHTFNYQTRQEFLLRRFSFSG 

NTLVSSSADPEGHFETPIWIERWIIGAGKPAAW 

LQTKGSPESRLSFQHDPETSVLVLRKPGINVASD 

WSIHLR 


3849 


A 


1 


1717 


RARNARGCWGVCRSGFSSAVCGAARMEQVAEG 

ARVTAVPVSAADSTEELAEVEEGVGVVGEDNDA 

AARGAEAFGDSEEDGEDVFEVEKILDMKTEGGK 

VLYKVRWKGYTSDDDTWEPEIHLEDCKEVLLEF 

RKKIAENKAKAVRKDIQRLSLNNDIFEANSDSDQ 

QSETKEDTSPKKKKKKLRQREEKSPDDLKJKJO0\ 

KAGKLKDKSKPDLESSLESLVFDLRTKKRISEAK 

EELKESKKPKKDEVKETKELKKVKKGEIRDLKT 

KTREDPKENRKTKKEKFVESQVESESSVLNDSPF 

PEDDSEGLHSDSREEKQNTKSARERAGQDMGLE 

HGFEKPLDSAMSAEEDTDVRGRRKKKTPRKAED 

TRENRKLENKNAFLEKKTVPKKQRNQDRSKSAA 

ELEKLMPVSAQTPKGRRLSGEERGLWSTDSAEE 

DKETKJRNESKXPKKDEVKETKELKKVKKGEIRD 

LKTKTREDPKENRKTKKEKFVESQVESESSVLND 

SPFPEDDSEGLHSDSREEKQNTKSARERAGQDM 

GLEHGFEKPLDSAMSAEEDTDVRGRRKKKTPRK 

AEDTRENOEIKLENKNAFLEKKTVPKKQRNQDRSK 

SAAELEKLMPVSAQTPKGRRLSGEERGLWSTDS 

AEEDKETKRNESKOKKDEVKETKELKKVKKGE 
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SEQID 
NO: 


Method 


Predicted 

beginDing 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucieotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, D>=:Aspartic Acid, 
£=Glutaniic Acid, F=Pbenylalanine, G=^;iycine, R»Histidine, 
l»lsoleuctne» K'^Lysine, LRLendne, M^Methiootne, 
N»Asparagine, P-Prollne, Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=VaIinc, W=Tryptoplian, Y^Tyrosine, 
X^'Unlcnown, *-Stop codon, /=po5sible nucleotide deletion, 
\=possible nucleotide insertion 










IRDLKTKTREDPKENRKTKKEKFVESQVESESSV 
LNDSPFPED/RQ*RATFRQQR£EKSPDDLKKKKA 
KAGKLKDKSKPDLESSLESLVFDLRTKKRISEAK 
EELKESBCKPK 


3850 


A 


1113 


3975 


PAAAAAAAAAAAAAAGRGPSFTPCFSPSLAVEPS 

RRTRLGSDPAQAMAGNVKKSSGAGGGSGSGGS 

GSGGLIGLMKDAFQPHHHHHHHLSPHPPGTVDK 

KMVEKCWKLMDKVVRLCQNPKLALKNSPPYIL 

DLLPDTYQHLRTILSRYEGKMETLGENEYFRVF 

MEl^MKKTKQTISLFKEGKERMYEENSQPRRNL 

TKLSLIFSHMLAELKGIFPSGLFQGDTFRITKADA 

AEFWRKAFGEKTIVPWKSFRQALHEVHPISSGLE 

AMALKSrmLTCNDYISVFEFDIFTRLFQPWSSLL 

RNWNSLAVTHPGY^IAFLTYDEVKARLQKFIHKP 

GSYIFRLSCTRLGQWAIGYVTADGNILQTIPHNKP 

LFQALIDGFREGFYLFPDGRNQNPDLTGLCEPTP 

QDHIKVTQEQYELYCEMGSTFQLCKICAENDKD 

VKIEPCGHLMCTSCLTSWQESEGQGCPFCRCEIK 

GTEPIWDPFDPRGSGSLLRQGAEGAPSPNYDDD 

DDERADDTLFMMKELAGAKVERPPSPFSMAPQA 

SLPPVPPRLDLLPQRVCVPSSASALGTASKAASGS 

LHKDKPLPVPPTLRDLPPPPPPDRPYSVGAESRPQ 

RRPLPCTPGDCPSRDKLPPVPSSRLGDSWLPRPTP 

KVPVSAPSSSDPWTGRELTNRHSLPFSLPSQMEP 

RPDVPRLGSTFSLDTSMSMNSSPLVGPECDHPKI 

KPSSSANAIYSLAARPLPVPKLPPGEQCEGEEDTE 

YMTPSSRPLRPLDTSOSSRACDCDOOIDSCTYEA 

MYNIQSQAPSITESSTFGEGNLAAAHANTGPEES 

ENEDDGYDVPKPPVPAVLARRTLSDISNASSS/FG 

LFVLERDP*PQNVTEGSQVPERPPKPFPRRINSER 

KAGSCQQGSGPAASAATANSPQLSSEIENLMSQG 

YSYQDIQKALVIAQNNIEMAKNILREFVSISSPAH 

VAT 


3851 


A 


2 

4 


2781 


GRVGSMDGAMGPRGLLLCMYLVSLLILQAMPA 

LGSATGRSKSSEKRQAVDTAVDGVFIRSLKVNC 

KVTSRFAHYVVTSQVVNTANEAREVAFDLEIPK 

TAFISDFAVTADGNAFIGDIKDKVTAWKQYRKA 

AISGENAGLVRASGRTMEQFTIHLTVNPQSKVTF 

QLTYEEVLKRNHMQYEIVIKVKPKQLVHHFEIDV 

DIFEPQGISKLDAQASFLPKELAAQTIKKSFSGKK 

GHVLFRPTVSQQQSCPTCSTSLLNGHFKVTYDVS 

RDKICDLLVANNHFAHFFAPQNLTNMNKlsP/VFV 

IDISGSMRGQKVKQTKEALLKILGDMQPGDYFD 

LVLFGTRVQSAVKGSLVQASEANLQAAQDFVRGF 

SLDEATNLNGGLLRGIEILNQVQESLPELSNHASI 

LIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLY 

NLGFGHNVDFNFLEVMSMENNGRAQRIYEDHD 

ATQQLQGFYSQVAKPLLVDVDLQYPQDAVLALT 

QNHHKQYYEGSEIWAGRIADNKQSSFKADVQA 

HGEGQEFSITCLVDEEEMKKLLRERGHMLENHV 

ERLWAYLTIQELLAKRMKVDREVRANLSSQALR 

MSLDYGFVTPLTSMSIRGMADQDGLKPTIDKPSE 

DSPPLEMLGPRRTFVLSALQPSPTHSSSNTQRLPD 

RVTGVDTDPHFIIHVPQKEDTLCFNINEEPGVILS 

LVQDPNTGFSVNGQLIGNKARSPGQHDGTYFGR 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

seauence 


Predicted end 
nucleotide 
location 
cormpooding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C=Cysteinc, D^Aspartic Acid, 
EF=Glutaniic Acid, F^^^Phenylalanine, G=Glycinc H»Hi$tidine, 
I»lsoIeudne, K-Lysine, I^Leucine, M=Methionine, 
N-Asparagine, F^Frolinc, <H»ltttamiiie, R>Argiaiaet 5=^erine^ 
T^^Threonlne, V^Valine, W«Tryptophan, Y«Tyrosine, 
X°Un known, *»Stop codon, /==possible nucleotide ddetion, 
V-possible nucleotide insertion 






/ 




LGIANPATDFQLEVTPQNITLNPGFGGPVFSWRD 

QAVLRQDGVWTINKKRNLVVSVDDGGTF\EVV\ 

LHRVW\KGSS\VHQDFLGLLMCWDKSIGMSSPGR 

KGCWGQ\FFHPIRFLKVS*HPPPGSDPQKAQMPT 

MVVTO^PGLTVTVRGLQKDYSKDPWHGAEVSC 

WF1\HNNGA*I\TDCAYTDYI\VPDIF 


3852 


A 


39 


1735 


TQVAEAGRGEGVVAGAETGRPQSAGMNLELLES 

FGQNYPEEADGTLDCISMALTCTFNRWGTLLAV 

GCNDGRIVIW\DF\LTRGIA*NKFSAHIHPVCSLC 

WSRDGHKLVSASTDNIVSQWDVLSGDCDQRFRF 

PSPILKVQYHPRDQNKVLVCPMKSAPVMLTLSD 

SKHVVLPVDDDSDLNVVASFDRRGEYIYTGNAK 

GKILVLKTDSQDLVASFRVTTGTSNTTAIKSIEFA 

RKGSCFLINTADRnRVYDGREILTCGRDGEPEPM 

QKLQDLVNRTPWKKCCFSGDGEYIVAGSARQH 

ALYIWEKSIGNLVKILHGTRGELLLDVAWHPVRP 

IIASISSGVVSIWAONOVENWSAFAPDFKELDEN 

VEYEERESEFDIEDEDKSEPEQTGADAAEDEEVD 

VTSVDPIAAFCSSDEELEDSKALLYLPIAPEVEDP 

EENPYGPPPDAVQTSLMDEGASSEKKRQSSADG 

SQPPKKKPKTTNIELQGVPNDEVHPLLGVKGDG 

KSKKKQAGRPKGSKGKEKDSPFKPKLYKGDRGL 

PLEGSAKGKVQAELSQPLTAGGAISELL 


3853 


A 


45 


2603 

* 


PLLFTCGREVRARDPEKEGTTVVAGLKVQVQPRF 

LWILCFSMEETQGELTSSCGSKTMANVSLAFRDV 

SIDLSQEEWECLDAVQRDLYKDVMLENYSNLVS 

LDLEYKYITK^^-LSEKNVCKIYLSQLQTGEKSKN 

TIHEDTIFRNGLQCKHEFERQERHQMGCVSQMLI 

QKQISHPLPIPKIHAREKSYECKECRKAFRQQSYLI 

QHLRIHTGERPYKCMECGKAFCRVGDLRVHHTI 

HAGERPYECKECGKAFRLHYHLTEHQRIHSGVK 

PYECKECGKAFSRVRDLRVHQTIHAGERPYECK 

ECGKAFRLHYQLTEHQRIHTGERPYECKVCGKT 

FRVQRHISQHQKIHTGVKPYKCNECGKAFSHGS 

YLVQHQKIHTGEKPYECKECGKSFSFHAELARH 

RRJHTGEKPYECRECGKAFRLQTELTRHHRTHTG 

EKPYECKECGKAFICGYQLTLHLRTHTGEIPYEC 

KECGKTFS SRYHLTQHYRIHTGEKPYICNECGKA 

FRLQGELTRHHRIHTCEKPYECKECGKAFIHSNQ 

FISHQRIHTSESTYICKECGKIFSRRYNLTQHFKIH 

TGEKPYICNECGKAFRFQTELTQHHRIHTGEKPY 

KCTECGKAFIRSTHLTQHHRIHTGEKPYECTECG 

KTFSRHYHLTQHHRGHTGEKPYICNECGNAFICS 

YRLTLHQRIHTGELPYECKECGKTFSRRYHLTQH 

FRLHTGEKPYSCKECGNAFRLQAELTRHHIVHTG 

EKPYKCKECGKAFSVNSELTRHHRIHTGEKPYQC 

KECGKAFIRSDQLTLHQ\KIILVR\NPMHNVKRIR 

WPLENAL*QRICNLROTLFVTEHVGIPFTSCSQFI . 

RNYFVC 


3854 


A 


108 


m 


LQSCWVPGIPWPSVGWLSWLKDLPSCEIHSASLS 

AVLQGPQCSEMLWPKNLTSWDDSSSVSSGISDTI 

DNLSTDDINTSSSISSYANTPASSRKNLDVQTDAE 

KHSQVERNSLWSGDDVKKSDGGSDSGIKMEPGS 

KWRRNPSDVSDESDKSTSGKKNPVISQTGSWRR 

GMTAQVGITMPRTKASAPAGALKTPGTGKRPGL 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue of 
peptide 
sequence 


Amino acid sequence (A^AIanine C-Cysteine, D^^Aspartic Acid, 
£»G]utaniic Acid, F<=PlienyIalanine, C=Clycine, H=Histidine, 
I^Isoleucine, K»Lysine, L=Lcucine, M=Mcthlonine, 
N-Asparagine, P^Proline, Q=Giutamine, R^^Arginine, S^erine, 
T"Threonine, V=Vallne, W»Tryptoplian, Y=Tyrosine, 
X=Unl;nown, *=Stop codon, /^possible nucleotide deletion, 
V^^possible nucleotide insertion 










S\GPGAPTPAAPPQLARMAWAFSLSAASTPAVSP 
STSPSAVEGSPATILPLASSPPPRTTP*LPLSELTV* 
RPQELVRGRGCLGPGAPTPAAPPQLARMAWAFS 
LSAASTPAVSPSTSPSAVEGSPATILPLASSPPPRT 
TP 


3855 


A 


1 


772 


FRGGDGAPGVLKPGNPLPFPLPPLQYPPPSTLSHS 

VGKSSLVLRFVKGQFUEYQESTIGAAFLTQSVCL 

DDTTVKFEIWDTAGQERYHSLAPNIYYRGAQAAI 

VVYDITNQETFARAKTWVKELQRQASP\SIWGL 

AGNKADLANKRMVEYEEAQAYADDNSLLFMET 

SAKTAMNVNDLFL\AIA*EVAKRVNPQNLG\G\A 

AGRSRGVDLHEQS\QQNKSQCCSN 


3856 

• 


A 


2815 


352 


LGLEAAARPRPGGPAAMQDGNFLLSALQPEAGV 

CSLALPSDLQLDRRGAEGPEAERLRAARVQEQV 

RARLLQLGQQPRHNGAAEPEPEAETARGTSRGQ 

YHTLQAGFSSRSQGLSGDKTSGFRPIAKPAYSPA 

SWSSRSAVDLSCSRRLSSAHNGGSAFGAAGYGG 

AQPTPPMPTRPVSFHERGGVGSRADYDTLSLRSL 

RLGPGGLDDRYSLVSEQLEPAATSTYRAFAYER 

QASSSSSRAGGLDWPEATEVSPSRTIRAPAVRTL 

QRFQSSHRSRGVGGAVPGAVLEPVARAPSVRSLS 

LSLADSGHLPDVHGFNSYGSHRTLQRLSSGFDDI 

DLPSAVKYLMASDPNLQVLGAAYIQHKCYSDAA 

AKKQARSLQAVPRLVKLFNHANQEVQRHATGA 

MRNLIYDNADNKLALVEENGIFELLRTLREQDDE 

LRKNVTGILWNLSSSDHLKDRLAKKTPLEVQL'RD 

LGV*APLSGAGGPP\L1QQNASEAEFYNATGFPR 

NLSSASQATRQKMRECHGLVDALVTSINHALDA 

GKCEDKSVENAVCVLRNLSYRLYDEMPPSALQR 

LEGRGRRDLAGAPPGEVYGCFTPQSRRLRELPLA 

RCELNRHTTEAAAGALQNITGGXDPRGPGGLSRL 

ALEQERILWLLDRVRTADHHQLRSLTGLIRNLS 

RNARNKDEMSTKVV\SHLI\EKLPGSVGEKSPPAE 

\nLV\NI\IAWT^GWLASPI/ALARDLLYFDGLRK 

LIFIKKKRDSPDSEKSSRAASSLLAM.WQYNKLH 

RDFRAKGYRKEDFLGP 


3857 


A 


1034 


204 


VAVTLLSQLPSAIQRTAAWEMRAPLTFRVPLALD 
LIKPEHCTVNVDNSLSIPVIAAELVVRKPSEKGM 

MFALPLK*PVTAAFHDSSMPSSLLQIEMEQLFLE 

ARLQ/PDSKSEARRNQCDSMLLRNQQLCSTCQE 

MKMVQPRTMKIPDDPKASFENCMSYRMSLHQP 

KFQTTPEPFHDDIPTENIHLQNL/PILGPRTAVFHG 

LLTEAYKTLKERQRSSLPRKEPIGKTTEAVSGRSS 

SPPRLPERK 


3858 


A 


203 


3469 


SHQEIEQNSAMAPRKRGGRGISFIFCCFRNNDHPE 

ITYRLRNDSNFALOTMEPALPMPPVEELDVMFSE 

LVDELDLTDKHREAMFALPAEKKWQIYCSKKK 

DQEENKGATSWPEFYIDQLNSMAARKSLLALEK 

EEEEERSKTIESLKTALRTKPMRFVTRFIDLDGLS 

CILNFLKTMDYETSESRIHTSLIGCIKALlVfNNSQG 

RAHVLAHSESINVIAQSLSTENIKTKVAVLEILGA 

VCLVPGGHKKVLQAMLHYQKYASERTRPQTLIN 
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1 SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine C^Cysteine, D=Aspartlc Acid, 
EK^Iutamic Acid, F=PhenylaIanine, G==Glycine, H-Histidinc, 
I^Isoleucine, K=Lysine, L<=Leucine, M=Methionine, 
N»Asparagine, F^Proline, Q»Glutamine, R=Arginine, S»Serine, 
T^Thrconine, V^Valine, W^Tryptoplian, Y=Tyrosine, 
X»Unkno>vn, *<=^top codon» /^possible nucleotide deletion, 
\<=q}Ossible nucleotide insertion 










DLDKSTGRYRDEVSLKTAIMSFINAVLSQGAGVE 

SLDFRLHLRYE\FLMLGIHPVMDKLRKHENSTLD 

RHLDFFEMLRNEDELEFAKRFELVHIDTKSATQM 

FELTRKRLTHSEAYPHFMSILHHCLQMPYKRSGN 

TVQYWLLLDRnQQIVIQNDKGQDPDSTPLENFNI 

KNWRMLVNENEVKQWKEQAEKMRKEHNELQ 

QKLEKi:£RECDAKTQEKEEMMQTLNKMKEKLE 

KETTEHKQVKQQVADLTAQLHELSRRAVCASIP 

GGPSPGAPGGPFPSSVPGSLLPPPPPPPLPGGMLPP 

PPPPLPPGGPPPPPGPPPLGAIMPPPGAPMGLALK 

KKSIPQPTNALKSFNWSKLPENKLEGTVWTEIDD 

TKVFKILDLEDLERTFSAYQRQQDFFVNSNSKQK 

EADAIDDTLSSKLKVKELSVIDGRRAQNCNILLS 

RLKi^NDEIKRAILTMDEQEDLPKDMLEQLLKFV 

PEKSDIDLLEEHKHELDRMAKADRFLFEMSRINH 

YQQRLQSLYFKKKFAERVAEVKPKVEAIRSGSEE 

VFRSGALKQLLEVVLAFGNYMNKGQRGNAYGF 

KISSLNKIADTKSSIDKNITLLHYLITIVENKYPSV 

VETELEYQKSQPPQPGDKFVSWSQFITVASFSFS 

DVEDLLAEAKDLFTKAVKHFGEEAGKIQPDEFF 

GIFDQFLQAVSEAKQENENMRKKKEEEERRARM 

EAQLKEQRERERKMRKAKENSEESGEFDDLVSA 

LRSGEVFDKDLSKLKRNRKRITNQMTDSSRERPI 

TKLNF 


3859 


A 


1279 


141 


RVEHLSEFLVDIKPSLTFDVIPLLDPYGPAGSDPS 

LEFLVVSEETYRGGMAINRFRLENDLEELALYQI 

QLLKDLRHTENEEDKVSSSSFRQRMLGNLLRPPY 

ERPELPTCLYVIGLTGISGSGKSSIAQRLKGLGAF 

VIDSDHLGHRAYAPGGPAYQPVVEAFGTDILHK 

REEMDRAVAEGBCRVCVIDAAVLLEAGWQNLVH 

EVWTAVIPETEAVRRIVERDGLSEAAAQSRLQSQ 

MSGQQLVEQSHVVLSnCGSRISPNARWRKPGPS 

CRSAFPRLIRPSTEKFSVGPDWLLELTSDPVVRRN 

GGLDAHPGSGPEVQAILCRTWPGLVDTGSLPNTL 

VFGQH 


3860 


A 


1 


3881 


MGQKSVGASYVQIPLVPPLSRHPKGLGHEDRWS 

SYCLSSLAAQNICTSKLHCPAAPEHTDPSEPRGSV 

SCCSLLRGLS SG WSSPLLPAP VCNPNKAIFT VDA 

KTTEILVANDKACGLLGYSSQDLIGQKLTQFFLR 

SDSDVVEALSEEHMEADGHAAWFGTWDIISRS 

GEKIPVSVWMKJIMRQERRLCCVVVLEPVERVST 

WVAFQSDGTVTSCDSLFAHLHGYVSGEDVAGQ 

HITDLIPSVQLPPSGQHIPKNLKIQRSVGRARDGT 

TFPLSLKLKSQPSSEEATTGEAAPVSGYRASVWV 

FCTISGLITLLPDGTmGINHSFALTLFGYGKTELL 

GKNITFLIPGFYSYMDLAYNSSLQLPDLASCLDV 

GNESGCGERTLDPWOGODPAEGGODPRINVVLA 

GGHVVPRDEIRKLMESQDIFTGTQTELIAGGQLL 

SCLSPQPAPGVDNVPEGSLPVHGEQALPKDQQIT 

ALGREEPVAIESPGQDLLGESRSEPVDVKPFASCE 

DSEAPVPAEDGGSDAGMCGLCQKAQLERMGVS 

GPSGSDLWAGAAVAKPQAKGQLAGGSLLMHCP 

CYGSEWGLWWRSQDLAPSPSGMAGLSFGTPTLD 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

locacioo 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=»Alanine C=Cysteine, D^Aspartic Acid, 
E^GIutamic Acid, F=Phenyla!anmc, G^GIycine, H-Histidine, 
iBlsoleudne, K-Lysine, l^Leudne, A^Metbiooine, 
N-Asparagine, P^ProIinc, Q=Giutaniine, R==Argimne, So^Serine, 
T^^Threoninc, V^Valine, W«Tryptophan, Y=Tyrosine, 
X=Unl(nown, *=Stop codon, A=possib]e nucleotide deletion, 
>^possible nucleotide insertion 










EPWLGVENDREELQTCLIKEQLSQLSLAGALDVP 

HAELVPTECQAVTAPVSSCDLGGRDLCGGCTGS 

SSACYALATDLPGGLEAVEAQEVDVNSFSWNLIC 

ELFFSDQTDQTSSNCSCATSELRETPSSLAVGSDP 

DVGSLQEQGSCVLDDRELLLLTGTCVDLGQGRR 

FRESCVGHDPTEPLEVCLVSSEHYAASDRESPGH 

VPSTLDAGPEDTCPSAEEPRLNVQVTSTPVIVMR 

GAAGLQREIQEGAYSGSCYHRDGLRLSIQFEVRR 

VELQGPTPLFCCWLVKDLLHSQRDSAARTRLFL 

ASLPGSTHSTAAELTGPSLVEVLRARPWFEBPPK 

AVELEGLAACEGEYSQKYSTMSPLGSGAFGFVW 

TAAODKEKI^VWKFIKKEKVLEDCWIEDPKLG 

KVTLEIAILSRVEHANIIKVLDIFENQGFFQLVME 

KHGSGLDLFAFIDRHPRLDEPLASYIFRQVRAG\Q 

SRLVSAVGYLRLKDIIHRDIKDENIVIAEDFTIKLI 

DFGSAAYLERGKLFYTFCGTIEYCAPEVLMGNPY 

RGPELEMWSLGVTLYTLVFEENPFCELEETVEAA 

IHPPYLVSKELMSLVSGLLQPVPERRTTLEKLVT 

DPWVTQPVNLADYTWEEVFRVNKPESGVLSAAS 

LEMGNRSLSDVAQAQELCGGPVPGEAPNGQGCL 

HPGDPRLLTS 


3861 


A 


1 


.3881 

1 


MGQKSVGASYVQIPLVPPLSRHPKGLGHEDRWS 

SYCLSSLAAQNICTSKLHCPAAPEHTDPSEPRGSV 

SCCSLLRGLSSGWSSPLLPAPVCNPNKAIFTVDA 

KTTEILVANDKACGLLGYSSQDLIGQKLTQFFLR 

SDSDVVEALSEEHMEADGHAAVVFGTWDnSRS 

GEKIPVSVWMKRMRQERBLCCVVVLEPVERVST 

WVAFQSDGTVTSCDSLFAHLHGYVSGEDVAGQ 

HITDLIPSVQLPPSGQHIPKNLKIQRSVGRARDGT 

TFPLSLKLKSQPSSEEATTGEAAPVSGYRASVWV 

FCTISGLITLLPDGTIHGINHSFALTLFGYGKTELL 

GKNITFLIPGFYSYMDLAYNSSLQLPDLASCLDV 

GNESGCGERTLDPWQGQDPAEGGQDPRINWLA 

GGHWPRDEIRKLMESQDIFTGTQTELIAGGQLL 

SCLSPQPAPGVDNVPEGSLPVHGEQALPKDQQIT 

ALGREEPVAIESPGQDLLGESRSEPVDVKPFASCE 

DSEAPVPAEDGGSDAGMCGLCQKAQLERMGVS 

GPSGSDLWAGAAVAKPQAKGQLAGGSLLMHCP 

CYGSEWGLWWRSQDLAPSPSGMAGLSFGTPTLD 

EPWLGVENDREELQTCLIKEQLSQLSLAGALDVP 

HAELVPTECQAVTAPVSSCDLGGRDLCGGCTGS 

SSACYALATDLPGGLEAVEAQEVDVNSFSWNLK 

ELFFSDQTDQTSSNCSCATSELRETPSSLAVGSDP 

DVGSLQEQGSCVLDDRELLLLTGTCVDLGQGRR 

FRESCVGHDPTEPLEVCLVSSEHYAASDRESPGH 

VPSTLDAGPEDTCPSAEEPRLNVQVTSTPVIVMR 

GAAGLQREIQEGAYSGSCYHRDGLRLSIQFEVRR 

VELQGPTPLFCCWLVKDLLHSQRDSAARTRLFL 

ASLPGSTHSTAAELTGPSLVEVLRARPWFEEPPK 

AVELEGLAACEGEYSQKYSTMSPLGSGAFGFVW 

TAVDKEKNKEVVVKFIKKEKVLEDCWIEDPKLG 

KVTLEIAILSRVEHANDKVLDIFENQGFFQLVME 

KHGSGLDLFAFIDRHPRLDEPLASYIFRQVRAG\Q 

SRLVSAVGYLRLKDIIHRDIKDENIVIAEDFTIKLI 

DFGSAAYLERGKLFYTFCGTIEYCAPEVLMGNPY 



454 



wo 01/57190 



PCT/USOl/04098 



SEQD> 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of. 
peptide 
sequence 


Amino acid sequence (A^AIanine OCysteine, D=Aspartic Acid, 
E==Glutamic Acid, F^Phenylalanine, G=Glyciae, H»Histidine, 
I=lsoleucine, K^Lysinc, L»Leucine, M=Methioniae, 
NoAsparagine, Ps^Prollne, (XSIataniiiie, R^Afgioine, S^Serine, 
T°Threonine, V^Valinc, W=Tryptophan, Y^Tyrosine, 
X'^'linknown, *-Stop codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion 










RGPELEMWSLGVTLYTLVFEENPFCELEETVEAA 

IHPPYLVSKELMSLVSGLLQPVPERRTTLEKLVT 

DPWVTQPVNLADYTWEEVFRVNKPESGVLSAAS 

LEMGNRSLSDVAQAQELCGGPVPGEAPNGQGCL 

HPGDPRLLTS 


3862 


A 


399 


2069 


TMDRSKRNSIAGFPPRVEXRLEEFEGGGGGEGNV 

SQVGRVWPSSYRALISAFFRLTRLDDFTCEKIGSG 

FFSEVFKVRHRASGQVMALKMNTLSSNRANML 

KEVQLMNRLSHPNILRYINSGNLEQLLDSNLHLP 

WTVRVKLAYDUVGLSYLHFKGIFHRDLTSKNC 

LIKRDENGYSAWADFGLAEKIPDVSMGSEKLA 

VVGSPFWMAPEVLRDEPYNEKADVFSYGIILCEII 

ARIQADPDYLPRTENFGLDYDAFQHMVGDCPPD 

FLQLTFNCCNMDPKLRPSFVEIGKTLEEILSKLQE 

EEQERDRKLQPTARGLLEKAPGVKRLSSLDDKIP 

HKSPCPRRTTWLSRSOSDIFSRKPPRTVSVLDPYY 

RPRDGAARTPKVNPFSARQDLMGGKIKFFDLPSK 

SVISLVFDLDAPGPGTMPLADWQEPLAPPIRRWR 

SLPGSPEFLHQEACPFVGREESLSDGPPPRLSSLK 

YRVKEIPPFRASALPAAQAHEAMDCSILQEENGF 

GSRPQGTSPCPAGASBEMEVEERPAGSTPATFSrS 

GIGLQTQGKQDG 


3863 


A 


399 


2069 


TMDRSKRNSIAGFPPRVENRLEEFEGGGGGEGNV 

SQVGRVWPSSYRALISAFFRLTRLDDFTCEKIGSG 

FFSEWKVRHRASGQVMALBCMNTLSSNRANML 

KEVQLMNRLSHPNILRYINSGNLEQLLDSNLHLP 

WTVRVKLAYDIAVGLSYLHFKGIFHRDLTSKNC 

LIKRDENGYSAWADFGLAEJOPDVSMGSEKLA 

VVGSPFWMAPEVLRDEPYNEKADVFSYGIILCEII 

ARIQADPDYLPRTENFGLDYDAFQHMVGDCPPD 

FLQLTFNCCNMDPKLRPSFVEIGKTLEEILSRLQE 

EEQERDRKLQPTARGLLEKAPGVKRLSSLDDKIP 

HKSPCPRRTIWLSRSOSDIFSRKPPRTVSVLDPYY 

RPRDGAARTPKVNPFSARQDLMGGKIKFFDLPSK 

SVISLVFDLDAPGPGTMPLADWQEPLAPPIRRWR 

SLPGSPEFLHQEACPFVGREESLSDGPPPRLSSLK 

YRVKEIPPFRASALPAAQAHEAMDCSILQEENGF 

GSRPQGTSPCPAGASEEMEVEERPAGSTPATFSTS 

GIGLQTQGKQDG 


3864 


A 


3 


911 


SWNMDSDSCAAAFHPEEYSPSCKRRRTVEDFNK 

FCTFVLAYAGYIPYPKEELPLRSSPSPANSTAGTI 

DSDGWDAGFSDIASSVPLPVSDRCFSHLOPTLLO 

RAKPSNFLLDRXKTDKLKKKKKRKRRDSDAPGK 

EGYRGGLLKLEAADPYVETPTSPTLQDIPQAPSD 

PCSGWDSDTPSSGSCATVSPDQVKEIKTEGKRTI 

VR/QEAQLMARNDGNFSSLLESIFPS\DDDSWDLV 

TCFCMKPFAGRPMBECNECHTWJHLSCAKIRKSN 

VPEVFVCQKCRDSKFDIRRSNRSRTGSRKLFLD 


3865 . 


A 


3 


3573 


QERLRSRSRPDRAAREAGSARGRQPKRTERVEQ 

FLTIARRRGRRSMPVSLEDSGEPTSCPATDAETAS 

EGSVESASETRSGPQSASTAVKERPASSEKVKGG 

DDHDDTSDSDSDGLTLKELQNRLRRKREQEPTE 

RPLKGIQSRLRKKRREEGPAETVGSEASDTVEGV 

LPSKQEPENDQGWSQAGKDDRESKLEGKAAQD 

IKDEEPGDLGKPKPEGEGYDPNALYCICRQPHNN 
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SEQ ID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A-Alanine C=Cysteinc, D-Aspartic Acid, 
E-Glutamic Add, F=Plienylalanine, G=GIycine, H^Histtdine, 
l»lsoleucine, K'^LysinCf LF^Leucine, M^^Methionine, 
N=Asparagiae, P==PFoIine^ Q^lutaaiine» R-Arginine, S^^Serine, 
T=Thrconine, V=Valine, W=Tryptophan, YeTyrosine, 
X<»Unknoivn» *^top codon, A^possible nucleotide deletion, 
\r=pos5ible nucleotide insertion 








* 


RFMICCDRCEEWFHGDCVGISEARGRLLEKNGE 

DYICPNCTILQVQDETHSETADQQEAKWRPGDA 

DGTDCTSIGTIEQKSSEDQGIKGRIEKAANPSGKK " 

KLKIFQPGPGPVPTQLPVLWQVLEIAVSRSISAFT 

LLHCISCKVIEAPGASKCIGPGCCHVAQPDSVYCS 

NDCILKHAAATMKFLSSGKEQKPKPKEKMKMK 

PEKPSLPKCGAQAGIKISSVHKRPAPEKKETTVK 

KAWVPARSEALGKEAACESSTPSWASDHNYNA 

VKPEKTAAPSPSLLYKSTKEDRRSEEKAAATAAS 

KKTAPPGSTVGKQPAPRNLVPKKSSFANVAAAT 

PAIKKPPSGFKGTIPKRPWLSATPSSGASAARQAG 

PAPAAATAASKKFPGSAALVGAVRKPVVPSVPM 

ASPAPGRLGAMSAAPSQPNSQIRQNIRRSLKEIL 

WKTRFLFFILFRVNDSDDLIMTENEVGKIALHEEK 

EMFNLFQVTDN/RAYKSKYRSIMFNLKDPKNQG 

LFHRVLREEISLAKLVRLKPEELVSKELSTWKER 

PARSVMESRTKLHNBSKKTAPRQEAIPDLEDSPP 

VSDSEEQQESARAVPEKSTAPLLDVFSSMLKDTT 

SQHRAHLFDLNCKICTGQVPSAEDEPAPKKQKLS 

ASVKKEDLKSKHDSSAPDPAPDSADEVMPEAVP 

EVASEPGLESASHPNVDRTYFPGPPGDGHPEPSPL 

EDLSPCPASCGSGWTTVTVSGRDPRTAPSSSCT 

AVASAASRPDSTHMVEARODVPKPVLTSVMVPK 

SILAKPSSSPDPRYLSVPPSPNISTSESRSPPEGDTT 
LFLSRLSTIWKGFINMQSVAKFVTKAYPVSGCFD 
YLSEDLPDTIHIGGRIAPKTVWDYVGKLKSSVSK 
ELCLIRFHPATEEEEVAYISLYSYFSSRGRFGVVA 
NNNRHVKDLYLIPLSAQDPVPSKLLPFEGPGKRR 
LSGWR 


3866 


A 


2 


3181 


AQQPVGRRGGASGAGGGRRGTPRPRAGAGPGF 

QVSSGGCRLSKMRRFLRPGHDPVRERLKRDLFQ 

FNKTVEHGFPHQPSALGYSPSLRILAIGTRSGAIK 

LYGAPG VEFMGLHQENNAVTQIHLLPGQCQLVT 

LLDDNSLHLWSLKVKGGASELQEDESFTLRGPP 

GAAPSATQITVVLPHSSCELLYLGTESGNVFVVQ 

LPAFRALEDRTISSDAVLQRLPEEARHRRVFEMV 

EALQEHPRDPNQILIGYSRGLVVIWDLQGSRVLY 

HFLSSQQLENIWWQRDGRLLVSCHSDGSYCQWP 

VSSEAQQPEPLRSLVPYGPFPCKAITRILWLTTRQ 

G\LPFTIFQGGMPRASYGDRHCISVIHDGQQTAFD 

FTSRVIGFTVLTEADPAATFDDPYALVVLAEEEL 

WJDLQTAGWPPVQLPYLASLHCSAITCSHHVSN 

IPLKLWERIIAAGSRQNAHFSTMEWPIDGGTSLTP 

APPQRDLLLTGHEDGTVRFWDASGVCLRLLYKL 

STVRVFLTDTDPNENLSAQGEDEWPPLRKVGSF 

DPYSDDPRLGIQKIFLCKYSGYLAVAGTAGQVLV 

LELNDEAAEQAVEQVEADLLQDQEGYRWKGHE 

RLAARSGPVRFEPGFQPFVLVQCQPPAWTSLAL 

HSEWRLVAFGTSHGFGLFDHQQRRQVFVKCTLH 

PSDQLALEGPLSRVKSLKKSLRQSFRRMRRSRVS 

SRKRHPAGPPGEAQEGSAKAERPGLQNMELAPV 

QRKIEARSAEDSFTGFVRTLYFADTYLKDSSRHC 

PSLWAGTNGGHYAFSLRVPPAERRMDEPVRAE 

QAKEIQLMHRAPWGILVLDGHSVPLPEPLEVAH 

DLSKSPDMQGSHQLLVVSEEQFKVFTLPKVSAK 
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SEQ ID 
NO; 


Method 

• 


Predicted 

beginning 

nucleotide 

location 

correspondiag 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A-Alanine C^KTysteine, I>=Aspartic Acid^ 
E>=Glutamic Acid, F^Pheuylalanine, G^lyctne, H^Histidine, 
Msoieudne, K'^^Lysine, I/»Leucine, M«Methionine, 
N=>A8paragine, P«Proline, Q=Glutamine» R^Arglnlne, S»Serine, 
T=Tbreonine, V=Valine, W=Tryptophan, Y=Tyrt)sine, 
X'=Unl(nown, *=Stop codon, /^possible nucleotide deletion, 
\^ossible nucleotide insertion 










LKLKLTALEGSRVRRVSVAHFGSRRAEDYGEHH 

LAVLTNLGDIQWSLPLLKPQVRYSCIRREDVSGI 

ASCVFTKYGQGFYLISPSEFERFSLSTKG\LVEPRC 

LVDSAETKlvTHRPGNGAGPKKAPSRARNSGTQSD 

GEEKQPGLVMERALLSDERAATGWHIEPPWGA 

ASAMAEQSEWLSVQAAR 


3867 


A 


2 


3181 


AQQPVGRRGGASGAGGGRRGTPRPRAGAGPGF 

QVSSGGCRLSKMRRFLRPGHDPVRERLKRDLFQ 

FNKTVEHGFPHQPSALGYSPSLRILAIGTRSGAIK 

LYGAPGVEFMGLHQENNAVTQIHLLPGQCQLVT 

LLDDNSLHLWSLKVKGGASELQEDESFTLRGPP 

GAAPSATQITVVLPHSSCELLYLGTESGNVFVVQ 

LPAFRALEDRTISSDAVLQRLPEEARHRRVFEMV 

EALQEHPRDPNQILIGYSRGLVVIWDLQGSRVXY 

HFLSSQQLENIWWQRDGRLLVSCHSDGSYCQWP 

VSSEAQQPEPLRSLVPYGPFPCKAITRILWLTTRQ 

G\LPFTIFQGGMPRASYGDRHCISVIHDGQQTAFD 

FTSRVIGFTVLTEADPAATFDDPYALVVLAEEEL 

VVIDLQTAGWPPVQLPYLASLHCSAITCSHHVSN 

IPLKLWERIIAAGSRQNAHFSTMEWPIDGGTSLrP 

APPQRDLLLTGHEDGTVRFWDASGVCLRLLYKL 

STVRVFLTDTDPNENLSAQGEDEWPPLRKVGSF 

DPYSDDPRLGIQKIFLCKYSGYLAVAGTAGQVLV 

LELNDEAAEQAVEQVEADLLQDQEGYRWKGHE 

RLAARSGPVRFEPGFQPFVLVQCQPPAWTSLAL 

HSEWRLVAFGTSHGFGLFDHQQRRQVFVKCTLH 

PSDQLALEGPLSRVKSLKKSLRQSFRRMRRSRVS 

SRKRHPAGPPGEAQEGSAKAERPGLQNMELAPV 

QRKIEARSAEDSFTGFVRTLYFADTYLKDSSRHC 

PSLWAGTNGGTIYAFSLRVPPAERRMDEPVRAE 

QAKEIQLMHRAPVVGILVLDGHSVPLPEPLEVAH 

DLSKSPDMQGSHQLLVVSEEQFKVFTLPKVSAK 

LKLKLTALEGSRVRRVSVAHFGSRRAEDYGEHH 

LAVLTNLGDIQWSLPLLKPQVRYSCIRREDVSGI 

ASCVFTKYGQGFYLISPSEFERFSLSTKG\LVEPRC 

LVDSAETKNHRPGNGAGPKKAPSRARNSGTQSD 

GEEKQPGLVMERALLSDERAATGWHIEPPWGA 

ASAMAEQSEWLSVQAAR 


3868 


A 


1 


2497 


GDSGGPLVCEEPSGRFFLAGIVSWGIGCAEARRP 

GVYARVTRLRDMLEATTKASMPLAPTMAPAPA 

APSTAWPTSPESPVVSTPTKSMQALSTVPLDWVT 

VPKLQECGARPAMEKPTRVVGGFGAASGEVPW 

QVSLKEGSRHFCGATVVGDRWLLSAAHCFNHT 

KVEQVRAHLGTASLLGLGGSPVKIGLRRWLHP 

LYTSTPGILDFDLAVLELASPLAFNKYIQPVCLPLAI 

QKFPVGRKCMISGWGNTQEGNATKPELLQKASV 

GIK)QKTCSVLYNFSLTDRMICAGFLEGKVDSCQ 

VSGIKALYESELADARRVLDETARERARLQIEIG 

KLRAELDEVNKSAKKREGELTVAQGRVKDLESL 

FHRSEVELAAALSDKRGLESDVAELRAQLAKAE 

DGHAVAKKQLEKETLMRVDLENRCQSLQEELDF 

RKSVFEEEVRETRRRHERRLVEVDSSRQQEYDFK 

MAQALEELRSQHDEQVRLYKLELEQTYQAKLDS 

AKLSSDQNDKAASAAREELKEARMRLESLSYQL 

SGLQKQASAAEDRIRELEEAMAGERDKFRKMLD 
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SEQID 
NO: 


Method 


Predicted 

begioning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^Alanine C-Cysteine, D^Aspartic Acid, 
EXSIutamtc Acid, F^Phenylalaaine, G^Glycine, H=Uistidine, 
I«l50ieuciiie« KpLyslac, LrsLeucine, M=>MethioniDe, 
N»Asparagine, P^Proiine, Q-Glutamine, R=Arginine, S^erine, 
T=Thrconine» V=Vallne, W^Tryptophan, Y=Tyrosinc, 
X=Unknown, *=>Stop codon, /^possible nucleotide deletion, 
\»possible nucleotide insertion 










AKEQEMTEMRDVMQQQLAEYQELLDVKLALD 

MEINAYRKLLEGEEERLKLSPSPSSRVTVSRATSS 

SSGSLSATGRLGRSKRKR\WRWRSPW\QRPKRPG 

HGHGWQRWLPPGPAGLGLGQR\HIEEIDLEGKFV 

QLKNNSDKDQSLGNWRIKRQVLEGEEIAYXFTP 

KYILRAGQMVTVWAAGAGVAHSPPSTLVWKGQ 

SSWGTGESFRTVLVNADGEEVAMRTVKKSSVM 

RENENGEEEEEEAEFGEEDLFHQQGDPRTTSRGC 

YVM 


3869 


A 


1 


1942 


RYRAGIPGDGRKDYIRLTKPGLTLPGRAMFARGS 

RRRRSGRAPPEAEDPDRGQPCNSCREQCPGFLLH 

GWRKICQHCKCPREEHAVHAVPVDLERIMCRLIS 

DFQRHSISDDDSGCASEEYAWVPPGLKPEQVYQ 

FFSCLPEDKVPYVNSPGEKYRIKQLLHLQLPPHDS 

EAQYCTAL\EE\EEKKELRAFSQQRKRENLG/RLG 

IVRIFPVXmGARCEECGKQlGGGDIAVFUSRASL 

GLLLGQPSCFWCTTCQELLVDLIYFYHVGKVYC 

GRHHAECLRPRCQACDEIIFSPECTEAEGRHWHM 

DHFCCFECEASLGGQRYVMRQSRPHCCACYEAR 

HAEYCDGCGEHIGLDQGQMAYEGQHWHASDRC 

FCCSRCGRALLGRPFLPRRGLIFCSRACSLGSEPT 

APGPSRRSWSAGPVTAPLAASTASFSAVKGASET 

TTKGTSTELAPATGPEEPSRFLRGAPHRHSNPEL 

GLRSVPEPPPESPGQPNLRPDDSAFGRQSTPRVSF 

RDPLVSEGGPRRTLSAPPAQRRRPRSPPPRAPSRR 

RHHHHNHHHHHNRHPSRRRHYQCDAGSGSDSE 

SCSSSPSSSSSESSEDDGFFLGERIPLPPHLCRPMP 

AQDTAMETFNSPSLSLPRDSRAGMPRQARDECNC 

IVA 


3870 


A 


2 


3485 


FVWRVFYVHASCMPPRARSWEGAHAPVGMHV 

AEAHACSSQQQQMPPAQFWMLEWLLHLCAFLS 

TPSFPHWCCCSNPHGSIADKPEEIVPASKPSRAAE 

MMAVEPRVATIKQRPSSRCFPAGSDMNSVYERQ 

GIAVMTPTVPGSPKAPFLGIPRGTMRRQKSE>SRI 

FLSGITEEERQFLAPPMLKFTRSLSMPDTSEDIPPP 

PQSVPPSPPPPSPTTYNCPKSPTPRVYGTIKPAFNQ 

NSAAKVSPATRSDTVATMMREKGMYFRRELDR 

YSLDSEDLYSRNAGPQANFRNKRGQMPENPYSE 

VGKIASKAVYVPAKPARRKGMLVKQSNVEDSPE 

KTCSIPIPTIIVKEPSTSSSGKSSQGSSMEIDPQAPE 

PPSQLRPDESLTVSSPFAAAIAGAVRDREKRLEA 

RRNSPAFLSADLGDEHVGLGPPAPRTRPSMFPEE 

GDFADEDSAEQLSSPMPSATPREPENHFVGGAEA 

SAPGEAGRPLNSTSKAQGPESSPAVPSASSGTAG 

PGNYVHPLTGRLLDPSSPLALALSARDRAMKES 

QQGPKGEAPKADLNKPLYIDTKMRPSLDAGFPT 

VTRQNTRGPLRRQETENKYETDLGRDRKGDDK 

KNMLBDIMDTSQQKSAGLLMVHTVDATKLDNA 

LQEEDEKAEVEMKPDSSPSEVPEGVSETEGALQI 

SAAPEPTTVPGRTIVAVGSMEEAVILPFRJPPPPLA 

SVDLDEDFIFTEPLPPPLEFANSFDIPDDRAASVPA 

LSDLVKQKKSDTPQSPSLNSSQPTNSAOSKKPAS 

LSNCLPASFLPPPESFDAVADSGIEEVDSRSSSDH 

HLETreilSTVSSISTLSSEGGENVDTCTVYADGQ 

AFMVDKPPVPPKPKMKPIIHKSNALYQDALVEE 
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$£QI1> 
NO: 


Method 


Predicted 

beginning 

nadeotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last arnino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Atanine C=Cysteinc, D^Aspartic Acid, 
E^Glutamic Acid, F^^'Phenylalanine, G=GIycine, H-Histidine, 
I'lsolcuclne, KpLyslne, DeLencine^ M^Metliionine, 
N=Asparagine, P^Proline, Q=>Glutainine. R-Arginine, S^^rine, 
T=Threonlne, V^Valine, W^^Tryptophan, Y=Tyrosine, 
X»Unknown, *=>Stop codon, /=po5sible nucleotide deletion, 
\=possible nucleotide insertion 










DVDSFVIPPPAPPPPPGSAQPGMAKVLQPRTSKL 

WGDVTEIKSPILSGPKANVISELNSILQQMNREKL 

AKPGEGLDSPMGAKSASLAPRSPEIMSTISGTRST 

TVIKIVRPGTSQPITLQSRPPDYESRTSGTRRAPS 

PVVSPTEMNKETLPAPLSAATASPSPALSDVFSLP 

SQPPSGDLFGLNPAGRSRSPSPSILQQPISNKPFTT 

KPVHLWTKPDVADWLESLNLGEHKEAFMDNEI 

DGSHLPNLQKEDLIDLGVTRVGHRMNIERALKQ 

LLDR 


3871 


A 


35 


1171 


VESRSAWHEGEDQIDRLDFIRNQMNLLTLDVKK 

KIKEVTEEVANKVSCAMTDEICRLSVLVDEFCSE 

FHPNPDVLKIYKSELNKHIEDGMGRNLADRCTD 

EVNALVLQTQQEIIENLKPLLPAGIQDKLHTLIPC 

KKFDLSYNLNYHKLCSDFQEDIVFRFSLGWSSLV 

HRFLGPRNAORVLLGLSEPIFOLPRSLA STPTAPT 

TPATPDNASQEELMITLVTGLASVTSRTSMGIIIV 

GGVIWKTIGWKLLSVSLTMYGALYLYERLSWTT 

HAKERAFKQQFVNYATEKLRMIVSSTSANCSHQ 

VKQQIATTFARLCQQVDITQKQLEEEIARLPKEID 

QLEKIQNNSKLLRNKAVQLENELENFTKQFLPSS 

NEES 


3872 


A 


35 


1171 


VESRSAWHEGEDQIDKLDFIRNQMNLLTLDVKK 

KIKEVTEEVANKVSCAMTDEICRLSVLVDEFCSE 

EHPNPDVLJOYKSELNKHIEDGMGKNLADRCTD 

EVNALVLQTQQEIIENLKPLLPAGIQDKLHTLIPC 

KBCFDLSYNLNYHKLCSDFQEDIVFRFSLGWSSLV 

HRFLGPRNAORVLLGLSEPIFOLPRSLASTPTAPT 

TPATPDNASQEELMITLVTGLASVTSRTSMGIIIV 

GGVIWKTIGWKLLSVSLTMYGALYLYERLSWTT 

HAKERAFKQQFVNYATEKLRMIVSSTSANCSHQ 

VKQQIATTFARLCQQVDITQKQLEEEIARLPKEID 

QLEKIQNNSKLJLRNKAVQLENELENFTKQFLPSS 

NEES 


3873 


A 


2944 


2089 


PVCTALTPGRMTDDKDVLRDVWFGRIPTCFTLY 

QDEITEREAEPYYLLLPRVSYLTLVTDKVKKHFQ 

KVMRQEDISEIWFEYEGTPLKWHYPIGLLFDLLA 

SSSALPWNITVHFKSFPEKDLLHCPSKDAIEAHF 

MSCMKEADALKHKSQVINEMQKKDHKQLWMG 

LQNDRFDQFWAINRKLMEYPAEENGFRYIPFRIY 

QTTTERPFIQKLFRPVAADGQLHTLGDLLKEVCP 

SAmPEDGEKJO^QVMIHGIEPMLETPLQWLSEHL 

SYPDNFLfflSnPQPTD 


3874 


A 


776 


366 


QARGAPSSPMCPLPLAAAAVAAPRAPLRLLNRG 

LAAAMSTAQSLKSVDYEVFGRVQGVCFRMYTE 

DEARKIGWGWVKNTSKGTVTGQVQGPEDKVN 

SMKSWLSKVGSPSSRIDRTNFSNEKTISKLEYSNF 

SIRY 


3875 


A 


1081 


182 


SLSSCQTDPRPMSAPLDAALHALQEEQARLKMR 

LWDLQQLRKELGDSPKDKVPFSVPKIPLVFRGHT 

QQDPEVPKSLVSNLRIHCPLLAGSALITFDDPKVA 

EQVLQQKEHTINMEECRLRVQVQPLELPMVTTIQ 

VMVSSQLSGRRVLVTGFPASLRLSEEELLDKLEIF 

FGKTRNGGGDVDVRELLPGSVMLGFARDGVAQ 

RLCQIGQFTVPLGGQQVPLRVSPYVNGHQKAEI 

RSQPVPRSVLVLNIPDILDGPELHDVLEIHFQKPT 
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SEQlD 
NO: 


Method 

• 


Predicted 
begmniDg 

11 U vV UUs 

location 

corresponding 
to first amino 
acid residue of 
DCDtide 

ft# W MiF *■ ^ 

sequence 


Predicted end 
nucleotide 

corresponding 
to last amino 
acid residue of 
peptide 
seauencc 


Amino acid sequence (A^Alanine C=Cysteiae, D^Aspartic Acid, 
£=Glutamic Acid, F=Pbenylalanine, G=Glycine, H»Histidine, 

M~*iaiiicuviiiCy n, ujr9iiic, Ai^AAUviucy ivimciiiiuniiic, 

N"Asparagine, P«»Prollne, Q^GIutamine, R^Arglnine, SBSerinc, 
T=Threonine, V=Valine, W=Tryptophan, Y«Tyroslne, 
X=Unkno%vn, *=Stop codon, /=possible nucleotide deletion, 
\-pos8ible nucleotide insertion 










RGGGEVEALTWPQGQQGLAVFTSESG 


3876 


A 


26 


431 


RMMKCPQALLAIFWLLLSWVSSEDKVVQSPLSL 
WHEGDTVTLNCSYEVTNFRSLLWYKQEKKAPT 
FLFMLTSSGIEKKSGRLSSILDKKELSSILNITATQ 
TGDSAIYLCAVEAQCSLVTCSLYSNSTAEALQL 


3877 


A 


3 


1291 


KAFRLLAERGAAAAMLWSGCRRFGARLGCLPG 

GLRVLVQTGHRSLTSCIDPSMGLNEEQKEFQKV 

AFDFAAREMAPim\EWDQKELFPVDVMRKAA 

QLGFGGVYIQTDVGGSGLSRLDTSVIFEALATGC 

TSTTAYISIHNMCAWMIDSFGNEEQRHKFCPPLC 

TMEKFASYCLTEPGSGSDAASLLTSAKKQGDHYI 

LNGSKAFISGAGESDIYVVMCRTGGPGPKGISCIV 

VEKGTPGLSFGKKEKKVGWNSQPTRAVIFEDCA 

VPVANRIGSEGQGFLIAVRGLNGGRINIASCSLGA 

AHASVILTRDHLNVRKQFGEPLASNQYLQFTLA 

DMATRLVAARLMVRNAAVALQEERKDAVALCS 

MAFCLFATDECFAICNQALQMHGGYGYLKDYAV 

QQYVRDSRVHQILEGSNEVMRILISRSLLQE 


3878 


A 


10 


1014 


LPGSTISSSGCQAPGRADSSGGARNSRRGDSRPG 

SCNRQAVAPPCPSPGPQSRHWIHRGTAPQAGETR 

TLGRGSSAPNACSASVTPCCPSSPPS*SCL*PTRRS 

PQNSSSrEVYRGFWQHGLPST**PFSS*QWPGQH 

TQGCSKLLGKQTTHLPCSTWPA**PSPSCLTRFR* 

W*PSLMCLWASSCSVCV*SPSGSCRH*LWGTHST 

SRTC*ARRSSALPTGLCTDDTSWASSSKARPCAL 

QRPSSLSSLSPCLTC*W*LSSSSPMSARSPAGAET 

GSWATGSPRLTQWKSSRLTSTSHSARSAWKPSA 

TESTPSWPRFSSWTSGEDPASPAPAI 


3879 


A 


200 


699 


LLLTGYIQTLQNQQLSGNQQEMQAVDNLTSAPG 

NI^LCTRDYKITQVLFPLLYTVLFFVGLITNGLA 

MRIFFQmSKSNFOTLKNTVISDLLMILTFPFKILS 

DAKLGTGPLRTFVCQVTSVIFYFTMYISISFLGLIT 

IDRYQK'l-lRPFKTSNPKNLLGAKILK 


38S0 


A 


26 


169 


QPETDTMVHLTPEEKSAVTALWGKVNVDEDAG 
DDLCQILVDRPRLRI 


3881 


A 


37 


1100 


TPLFDFWPGFVLSWLQPLSASLRARRAASGPPAC 

RIMPTTVDDVLEHGGEFHFFQKQMFFLLALLSAT 

FAPIYVGIVFLGFTPDHRCRSPGVAELSLRCGWSP 

AEELNYTVPGPGPAGEASPRQCRRYEVDWNQST 

FDCVDPLASLDTNRSRLPLGPCRDGWVYETPGSS 

IVTEFNLVCANSWMLDLFQSSVNVGFFIGSMSIG 

YIADRFGRKLCLLTTVLINAAAGVLMAISPTYTW 

MLIFRLIQGLVSKAGWLIGYILITEFVGRRYRRTV 

GIFYQVAYTVGLLVLAGVAYALPHWRWLQFTV 

ALPNFFFLLYYWCBPESPRWLISQNKNAEAMRIIK 

HIAKKNGKSLPASL 


3882 


A 


573 


1620 


KSICCRFPEGLSEGFGPMRKEALSSGSVQEAEAM 

LDEPQEQAEGSLTVYVISEHSSLLPQDMMSYIGP 

KRTAVVRGIMHREAFNnGRRIVQVAQAMSLTED 

VLAAALADHLPEDKWSAEKRRPLKSSLGYEITFS 

LLNPDPKSHDVYWDEEGAVRRYVQPFLKALGAA 

GNFSVDSQILYYAMLGVNPRFDSASSSYYLDMH 

SLPHVINPVESRLGSSAASLYPVLNFLLYVPELAH 

SPLYIQDKDGAPVATNAFHSPRWGGIMVYNVDS 

KTYNASVLPVRVEVDMVRVMEVFLAQLRLLFGI . 
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SEQID 
NO: 


Method 


Predicted 
beginning 
nucleotide 
Incfltion 

corresponding 
to first amino 
acid residue of 
peptide 
sequence 


Predicted end 

nucleotide 

location 

cnrmnnndino 

to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=Aianine C=Cysteine, D=Aspartic Acid, 
E^lutamtc Acid, F^Pbenylalanioe, G^lycine, M^Histidioe, 
I^Isoleucine, K=Lysine, L^Leuctne, M-Methionine, 

Ms Acnaraoinp P=Prnline. CVstf^lutAiniiiA RBAroininfk Cs^AiHn^ 

T^Threonine, V^Valinc, W-Tryptophan, Y^Tyrosine, 
X-UnknowD) *=Stop codon, /=possib]e nucleotide ddetion, 
^ppossible nucleotide insertion 










AQPQLPPKCLLSGPTSEGLMTWELDRLLWARSV 
ENLATArriLTSLA 


3883 


A 


2369 


844 


RIHREEDFQFILKGIARLLSNPLLQTYLPNSTKKIQ 

FHQELLVLFWKLCDFNKVGQPRGALQGDGEQLP 

Q*PGGRDSVRLRGVGQSCPSLELSPLGPSPHP*KF 

LFFVLKSSDVLDILVPILFFLNDARADQSRVGLM 

HIGVFILLLLSGECNFGVRLNKPYSIRVPMDIPVF 

TGTHADLLlVWFHKIITSGHQRLQPLFDCLLTrW 

NVSPYLKSLSMVTANKLLHLLEAFSTTWFLFSAA 

QNHHLVFFLLEVFNNIIQYQFDGNSNLVYAIIRKR 

QGGAPPWRAPAPLPLQSQAPSRPVWWLLQALTS 

*PRSPRCQRMAPCGPWNLSPSRAWRMAARLRGS 

PARHGGSSGDRP/HSSASGQWSPTPEWVLSWKS 

KLPLQTIMRLLQVLVPQVEKICIDKGLTDESEILR 

FLQHGTLVGLLPVPHPILIRKYQANSGTAMWFRT 

YMWGVIYLRNVDPPVWYDTDVKLFEIQRV 


3884 


A 


1 


804 


NGPRAPFSQEGQSTGPPPLIPRLGQHGAQGRIPPL 

GPEHGPERGPLRGGQDCRGPPDRRGPHPDFPDDF 

SRPDDFHPDKRFGHRLREFEGRGGPLPQEEKWR 

RGGPGPPFPPDHREFSEGDGRGAARGPPGAWEG 

RRPGG*TFPPGSRGPTFS/SGAEEESFRRGAPPRHE 

GRAPPRGRDGFPGPEDFGPEENFDASEEAARGRD 

LRGRGRGTPRGERVTBCDTWSGRIGCRIHWL 


3885 


A 


3 


996 


GRRRAGPAHSARMYNMMETELKPPGPQQTSGG 
GGGNSTAAAAGGNQKNSPDRVKRPMNAFMVW 
SRGQRRKMAQENPKMHNSEISKRLGAEWKLLSE 
TEKRPFroEAKRLRALHMKEHPDYKYRPRRKTK 

GAGVNQRMDSYAHMNGWSNGSYSMMQDQLG 

YPQHPGLNAHGAAQMQPMHRYDVSALQYNSM 

TSSQTYMNG/SRPTYSMSYSQQGTPGMAPGS\MG 

SVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMIS 

MYLPGAEVPEPAAPSRLHMSOHYOSGPVPGTAI 

NGTLPLSHM 


3886 


A 


773 


317 


QCTQKAAEGYTQFYYVDVLDGKLACVNKCTKG 
TKSQMNCNLGTCQLQRSGPRCLCPNTNTHWYW 
GETCEFNIAKSLVYGIVGAVMAVLLLALIILIILFS 
LSO\RKRHRPESEGEADFGLENATNNFG\PTLETV 
DSGTELHIQ\RPEMVASTV 


3887 


A 


3 


466 


VDFRVKTLLVDNKCFVLQLWDTAGQERYHSMT 

ROLLRKADGVVLMYDITSOESFAHVRYWLDCL 

QDAGSDGVVILLLGNKMDCEEERQVSVEAGQQL 

AQELGVYFGECSAALGHlNn[LEPVVNLARSLRMQ 

EEGLKDSLVKVAPKRPPKRFGCCS 


3888 


A 


3412 


3144 


QNTDITNFSSSWNDGLAFCALLHTYLFAHIPYQEL 

NSQDKRRNFMLAFQAAESVGIKSTLDINEMVRT 

ERPDWQNVMLYVTAIYKYFET 


3889 


A 


1 


1160 


LVVTAITAILAFPNEYTRMSTSELISELFNDCGLL 

DSSKLCDYENRFNTSKGGELPDRPAGVGVYSAM 

WQLALTLILKIVITIFTFGMKIPSGLFIPSMAVGAI 

AGRLLGVGMEQLAYYHQEWTVFNSWCSQGAD 

CITPGLYAMVGAAACLGGVTRMTVSLWIMFEL 

TGGLEYIVPLMAAAMTSKWVADALGREGIYDA 
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SEQID 
NO: 


Method 

• 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
Ducieotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A'^'Alanine C=Cysteine, D=Aspartic Acid, 
JE>Glotaniic Acid, F^PbenylaJanine, G»Giycioe. H^Histidine, 
lalsoleudne* K^Lysine, Ir=l<eudBe, M=Metliionine, 
N^Asparaglne, P=Pr€linc, Q=K21ntanilne» R=^rglnine, S^erine, 
T»Tbreoninc V=Val|ne, W=Tryptophan, Y=Tyroslne, 
X=sUnknown, *sStop codon, ^possible nucleotide deletion, 
^possible nucleotide insertion 










HmLNGWFLEABOBEFAHKTLAMDVMKPRRNDP 

LLTVTTQDSMTVEDVETnSETTYSGFPVVVSRES 

QRLVGFVLRRDLnSIENARKKQDGVVSTSIIYFTE 

HSPPLPPYTPPTLKLRNILDLSPFrVTDLTPMEIVV 

DFRKLGLRQCLVTHNGRLLGIirKKDVLKHIAQ 

MANQDPDSILFN 


3890 


A 


1 


387 


SWCWTGIFVLGTTNLRLEGSWYRSLWGPGFNTT 
TATLGFGAPQAPVGDVALNQPDMCVYRRGRXK 
RVPYTKLQLKELENE YAINKFINKDKRREJ S AAT 
NLSERQVTIWFQNRRVKDKKIVSKLKDTVS 


3891 


A 


2 


2914 


RGGGGDHKMADLSLLQEDLQEDADGFGVDDYS 

SESDVIIIPSALDLAST/QDEMVERPLGRL\DK\YA 

ASENHI*PDKMVAPEFASIPLRE\VCDDERDCIAV 

LGKN*PDWADDSEPT\VRAAELEQVPHIALFLFK 

KTRLSITICFFSKFLLPYCGLDTLADQN\NQVRKT 

SQAALLVALLEQELIERFDVETKVCPVLIELTAPDS 

M3DVKTEAVAIMCKMAP\MVGKDITERLILPRFC 

EMCCDCRMFHNVRKWCAANFGDICSWGQQAT 

EEMLLPRFFQLCSDlfVWGVRKACAECFMAVSC 

ATCQEIRRTKLSALFINLISDPSRWVRQAAFQSLG 

PFISTFANPSSSGQYFKEESKSSEEMSVE>}NKRTR 

DQEAPEDVQVRPEDTPSDLSVSNSSVBLENTMED 

HAAEASGKPLGEISVPLDSSLLCTLSSESHQEAAS 

NENDKKPGNYKSMLRPEVGTTSQDSALLDQELY 

NSFHFWRTPLPEIDLDIELEQNSGGKPSPEGPEEE 

SEGPVPSSPNITMATRKELEEMIENLEPHIDDPDV 

KAQVEVLSAALRASSLDAHEETISIEKRSDLQDE 

LDINELPNCKINQEDSVPLISDAVENMDSTLHYIH 

NDSDLSNNSSFSPDEERRTKVQDWPQALLDQY 

LSMTDPSRAQTVDTEIAKHCAYSLPGVALTLGR 

QNWHCLRETYETLASDMQWKVRRTLAFSIHELA 

VILGD\QLTAADLVPIFNGFLK*PSMKSRIGVLKH 

LHDFLKLLHIDKRREYLYOLOEFLVTDNSRNWR 

FRAELAEQLILLLELYSPRDVYDYLRPIALNLCAD 

KVSSVRWISYKLVSEMVKKLHAATPPTFGVDLIN 

ELVENFGRCPKWSGRQAFVFVCQTVIEDDCLPM 

DQFAVHLMPHLLTLANDRVPNVRVLLAKTLRQT 

LLEKDYFLASASCHQEAVEQTIMALQMDRDSDV 

KYFASIHPASTKISEDAMSTASSTY 


3892 


A 


158 


2191 


VPLPAPSGLSGGGSRGAGCKKAPPGRAPAPGLAP 

LRPSEPTMAVPPGHGPFSGFPGPQEHTQVLPDVR 

LLPRRLPLAFRDATSAPLRKLSVDLIICTYKHINEV 

YYAKKKRRAQQAPPQDSSNKKEKKVLNHGYDD 

DNHDYIVRSGERWLERYEIDSLIGKGSFGQVVKA 

YDHQTQEL VA IKIIKNKKAFLNQ AQIELRLLELM 

KQHDTEMKYVnVHLKRHFMFRN\HLCLVFELLS 

YNLYDLLRNTHFRGVSLNLTRKLAQQLCTALLF 

LATPELSUHCDLKPENILLCNPKRSAHaVDFGSS 

CQLGQRIYQYIQSRPYRSPEVLLGTTYDLAIDMW 

SLGCILVEMHTGEPLFSGSNEVCPQEGVDQMNRI 

VEVLGIPPAAMLDQAPKARKYFERLPGGGWTLR 

RTKELRKDYQGPGTRRLQEVLGVQTGGPGGRRA 

GEPGHSPAD\Y\LRFQDLVLRMLEYEPAARISPLG 

ALQHGFFRRTADEATNTGPAGSSASTSPAPLDTC 

PSSSTASSISSSGGSSGSSSDNRTYRYSNRYCGGP 
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SEQID 
NO: 


Method 

• 


Predicted 
beginning 
Ducleotide 

lOCfltlOD 

corresponding 
to first amino 
acid residue of 
Dcntide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
scouence 


Amino acid sequence (A-Alanine CXysteine, D^Aspartic Acid, 
£=<ilutamic Acid, F»Pnenylalanine, G^lyclne, H^Histidine, 
iBlsoleucine, K=Lysinc» L^Leucine, M»Metliionine, 
N=Asparagine, P==Proline, Q^GIntamlne, R^At^inine, SaSerine, 
T=Threonine, V=Valine, W=Tryptophan, Y=Tyrosine, 
X^Unkno^rn, *=Stop codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion 










GPPITDCEMNSPQVPPSQPLRPWAGGDVPHKTH 

QAPASASSLPGTGAQLPPQPRYLGRPPSPTSPPPP 

ELMDVSLVGGPADCSPPHPAPAPQHPAASALRT 

RMTGGRPPLPPPDDPATLGPHLGLRGVPQSTAAS 

S 


3893 


A 


68 


258 


PEEYYPFSPTLQQLFFFLLDSDMGSRPESMGCRK 
NTVPRPASPTEAGTDPQTFLHTWVSECRD 


3894 


A 


1120 


136 


SLPLAPAPAVAGPVALCPAGLCPAQPGMPAGPA 

AASGSHPEVGSVLQRSSQPHWPNPWPGAGHLPP 

PAGPFPYNPPAGPGAAAGLA*SPPRSSPTPCSVGP 

QSCPANASAPPAQPCLAGAPPAASLPPPGPGSVS 

AAPAPGGPAPAEPPLGVPPVPAWLLPDSPPLPGT 

HSGPPPAAVSLPPAAAACPVVVPPPLPHHPPDLES 

PSAAAPNPGCAGGIRHFPPGSPEASSPLRPAAAPA 

LLPLPRPPS*PA^PWKPLHSPVAVAGGSFVAGGSV 

LPAPDLDQPRPSGPPAASPTPGPGVAQPPPGSAVL 

PTVP*APPVSGAAPGRKREW 


3895 
* 


A 


2 


1347 


FGAVSYRPGNGSCWVKVTASSDLSDLISCLCPPR 

SLCSSQACVLPVPGPSLLLPQGLHVGCASAGTRW 

PLSCSroPQRLLAHEEETQKRRAKESGMAFTQLT 

FRDVAIEFSQDEWKCLNSTQRTLYRDVMLENYR 

NLVSLDLSRNCVIKELAPQQEGNP/ARSIPHSDIGT 

T*KT*H*RVLLQGNQEKNTRL*LSVER**KKLQQ 

SDYGPKRKSYL*ERPTR*KRYRKQVY*TSA\*LSF 

LPHPHELQQFQAEGKIYECNHVEKSVNHGSSVSP 

PQnSSTIKTHVSNKYGTDFICSSLLTQEQKSCIRE 

KPYRYIECDKALNHGSHMTVRQVSHSGEKGYKC 

DLCGKVFSQKSNLARHWRVHTGEKPYKCNECD 

RSFSRNSCLALHRRVHTGEKPYKCYECDKVFSR 

NSCLALHQKTfflGEKPYTCKECGQAFSVRSTLTN 

HQVIHSDK 


3896 


A 


202 


498 


MVQSCSAYGCBCNRYDKDKPVSFHKFPLTRPSLC 
KEWEAAVRRKNFKPTKYSSICSEHFTPDCFKREC 
NNKLLKENAVPTIFLCTEPHDKKEDLLEPQEQ 


3897 


A 


2 


382 


SHGLSRAPHLSAAPAPALASRPCFSSAPCSQGGG 
GGGPATMIHFILLFSRQGKLRLQKWYITLPDKER 
KKITREIVQIILSRGHRTSSFVDWKELKLVYKRYA 
SLYFCCAIEVNQDNELLTLENVHR 


3898 


A 


718 


305 


SEQEPLLGDTPGSREWDILETEEHYKSRWRSIRIL 
YLTMFLSSVGFSWMMSIWPYLQKIDPTADTSFL 
GWVIASYSLGQMVASPIFGLWSNYRPRKEPLIVSI 
LISVAANCLYAYLHPASHNKYYMLVARGLLGIG 


3899 


A 


24 


718 


FRGRPGIPEREGKGNHSFVEVARVIVVDLHSRLG 

GAMAERKGTAKVDFLKKIEKEIQQXWDTERVFE 

VNASNLEKQTSKGKYFVTFPYPYMNGRLHLGHT 

FSLSKCEFAVGYQRLKGKCCLFPFGLHCTGMPIK 

ACADKLKREIELY/GCPPDFPDEEEEEEETSVKTE 

DIIIKDKAKGECKSKAA/AKAGSSKYQWGIMKSLG 

LSDEEIVKFSEAEHWLDYFNALAIQDLKRMG 


3900 


A 


360 


1 


VPATSSNVSPSSSESSEPDLSSRSSSSDAPSSSPSVP 
SPCSLSLSSPESPLLPTLLSSKSPAGSAGPTCGCPS 
GPGLRATA/PSRLSSSIAAH/SSSAPETSRPAAARE 
RSPPLHDRESHE 


3901 


A 


193 


345 


GEWAVPPAPGGQGVSIPHGPEPGQGSGVHIAPRQ 
GEGSDRTEPLICPKAAP 
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SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
correspondii^ 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A'^AIanine C»Cysteine, D=^A$partic Acid, 
E'^Glutamic Acid, F=PhenylaIanine, G'^Glycine, H^^Histidioe, 
lalsoleuclne, K=Lysine, L=LcuciDe, M^Methionine, 
N^Asparaglne, P»ProIine» Q^lutaminet R^Arginlne, S^^crine, 
T=Thrconine, V=Valinc, W=Tryptophan, Y=Tyrosinc, 
X-Unknown, *=Stop codon, /=possible nucleotide deletion, 
\^pos5ibie nucleotide insertion 


3902 


A 


1188 


1389 


NPAARSAAAREGSPALPPPPVS/SSSGLGLLLPLSP 
PGSHAANPALSPRAPHSHYRPRPRCGPRRRPR 


3903 


A 


63 


396 


NNMRNPHLSSNHYLNLARTETVFARMESVKQRJ 
LAPGKEGLKNFAGKSLGQIYRVXEKKQDTGETIE 
LTEDGKPL* VPERKAPLCDCTCFGLPRRYIIAIMS 
GLGFCISFG 


3904 


A 


732 


1046 


AMSECPLILYEHKHIDTYSQSYLFNDLFYPVYSGG 
RMVTYEHLREVVFGKSEDEHYPLW*VLFGK*YA 
VAPNALMFIRFM*NCTFVPKLP*VMDLK**LQYK 
SR 


3905 


A 


46 


910 

• 


QPPPPPPPPPSPPPPPFPPARALSHLRLHPDACLFPS 

PFPLPCSTMPGMMEKGPELLGKNRSANGSAKSP 

AGGGGSGASSTNGGLHYSEPESGCSSDDEHDVG 

MRVGAEYQARIPEFDPGATKYTDKDNGGMLVW 

SPYHSIPDAKLDEYIAIAKEKHGYNVEQALGMLF 

WHKHNIEKSLADLPNFTPFPDEWTVEDKVLFEQ 

AFSFHGKSFHRIQQMLPDKTIASLVKYYYSWKK 

TRSRTSLMDRQARKLANRHNQGDSDDDVEETHP 

MDGNDSDYDPKKEAKKEGMS 


3906 


A 


2 


513 


KVCNCCSQELETSFTYVDKNINLEQRNRSSPSAK 
GHNHPGELGWENPNEWSQEAAISLISEEEDDTSS 
EATSSGKSIDYGFISAILFLVTGILLVIISYIVPREV 
TVDPNTVAAREMERLEKESARLGAHLDRCVIAG 
LCLLTLGGVILSCLLMMSMWKGELYRRNRFAS 


3907 


A 


71 


412 


ILIMSNCLQNFLKITSTRLLCSRLCQQLRSKRKFF 
GTVPISRLHRRVVITGIGLVTPLGVGTHLVWDRLr 
GGESGIVSLVGEEYKSIPCSVAAYVPRGSDEGQF 
NEQNFVSKSD 


3908 


A 


77 


746 


LGTLLGWRAPLFSRCLAFHSPFILLNTPKLVKTAE 

LPPDRNYVLGAHPHGIMCTGFLCNFSTESNGFSQ 

LFPGLRPWLAVLAGLFYLPVYRDYIMSFGLCPVS 

RQSLDFILSQPQLGQAVVIMVGGAHEALYSVPGE 

HCLTLQKRKGFVRLALRHGASLVPVYSFGENDIF 

RLKAFATGSWQHWCQLTFKKLMGFSPCIFWGR 

GLFSATSWGLLPFAVPITTV 


3909 


A 


1 


793 


FRAAGRPAAAMGDIPVVGLSSWKASPGKVTEAV 

KEAIDAGYRHFDCAYFYHNEREVGAGIRCKIKE 

GAVRREDLLIATKLWCTCHKKSLVETACRKSLK 

ALKLNYLDLYLMWPMGFKPPHPEWIMSCSELSF 

CLSHPRVQDLPLDESNMVIPSDTDFLDTWEAME 

DLVITGLVKNIGVSNFNHEQLERLLNKPGLRFKP 

LTNQIECHPYLTQKNLISFCQSRDVSVTAYRPLG 

GSCEGVDLIDNPVIKRIAKEHGKSPAQILI 


3910 


A 


202 


705 


FFTMHRKKVDNRJRILffiNGVAERQRSLFVVVGD 

RGKDQVVILHHMLSKATVKARPSVLWCYKKEL 

GFSSHRKKRMRQLQKKIKNGTLNIKQDDPFELFI 

AATNIRYCYYl^THKILGNTFGMCVLQDFEALTP 

NLLARTVETVEGGGLVVILLRTMNSLKQLYTVT 

M 


3911 


A 


3 


723 


AGRGARAAGEGGGPFKSRPRPLPSSRSLPAVGGG 

RYGADKMAAGGAVAAAPECRLLPYALHKWSSF 

SSTYLPENILVDKPNDQSSRWSSESNYPPQYLILK 

LERPAIVQNITFGKYEKTHVCNLKKFKVFGGMN 

EENMTELLSSGLKITOYNKETFTLKHKIDEQMFPC 

RFIKIVPLLSWGPSFNFSIWYVELSGIDDPDJVQPC 
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SEQID 


Method 

■ 


Predicted 

WAninvtlnAT 

uegmiiing 

nucleotide 

location 
corresponding 
to first amiDO 
acid residue of 
peptide 
sequence 


Predicted end 
nucicouoe 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A-Alanine C=Cysteine, ]>^Aspartic Acid, 
Ei^uiutaroic Acid, f^rnenyiaianiney u^^lycinCf H^Htstidine, 
l^lsoleucine, K>=Lysine, L^Lcucinc, M=Mcthionine, 
N==Asparagine, P=Proline, Q=Glutamine, R'^Arginine) S=Scrine, 
T=^Threonine, V^Vallnc, W=Tryptophan, Y=Tyrosine, 
X=Unknown^ *=Stop codon, /=>po5Sible nucleotide deletion, 
^possible nucleotide insertion 










LNWSKYREQEAIRLCLKHFRQHNYTBAFESLQ 
KKT 


3912 


A 


2 


461 


FEKKQLRRPSLFLLGCCSFGIMAPSLWKGLEGIG 

LFALAHAAFSAAQHRSYMRLTEKEDESLPIDIVL 

QTLLAFAVTCYGIVHIAGEFKDMDATSELKNKTF 

DTVRNHPSFYVFNHRGSEYFSGPSDTANSSNQDA 

LSSNTSLKLRKLESLRR 


3913 


A 


362 


20 


APGRPEAKVPERSRESGSRRVRGPLLQLRPGRTS 
RPASGRGRGGAGGSYGBCMRKPDSKIVLLGDMN 
VGKTSLLQRYMERRFPDTVSTVGGAFYLKQWRS 
YNISIWDTAGEAGAA 


3914 




A 


1 


7545 

4 


PGIRVGITSQTGLSSNLQENCSKLAFISSHGTEKQ 

LQCMPMEGRGRASSSISDLQGKGFEKGTGEKHV 

PGVGSARHSPQASAGGSPWQRGKAQTRWLGKP 

DPGRKRRRGSPQEEGGLRVSAAARLLCSGANRC 

KVLVRQNSTPNTQQPAVHPSTPPSRPLPQAGRCL 

VAPLRPHPDWVAAKTLAKALRAPGKPWRLAAP 

SPLGDLGAPGLPGPSTAPRTLSVEEPGVECNQLC 

LYADVTDPVLCLGQKDPGVEGKHCEKEKISSSK 

ELKHVHAKSEPSKPARRLSESLHWDENKNESKI 

EREHKRRTSTPVIMEGVQEETDTRDVKRQVERSE 

ICTEEPQKQKSTLKNEKHLKKDDSETPHLKSLLK 

KEVKSSKEKPEREKTPSEDKLSVKHKYKGDCMH 

KTGDETELHSSEKGLKVEENIQKQSQQTKLSSDD 

KTERKSKHRNERKLSVLGKDGKPVSEYIIKTDEN 

VRKENNKKERKLSAEKTKAEHKSRRSSDSKIQK 

DSLGSKQHGITLQRRSESYSEDKCDMDSTNMDS 

NLKPEEVVHKEKRRTKSLLEEKLVLKSKSKTQG 

KQVKVVETELQEGATKQATTPKPDKEKNTEEND 

SEKQRKSKVEDKPFEETGVEPVLETASSSAHSTQ 

KDSSHRAKLPLAKEKYKSDKDSTSTRLERKLSD 

GHKSRSLKHSSKDKKKDENKSDDKDGKEVDSS 

HEKARGNSSLMEKKLSRRLCENRRGSLSQEMAK 

GEEKLAANTLSTPSGSSLQRPKKSGDMTLIPEQEP 

MEIDSEPGVENVFEVSKTQDNKNNNSHQDIDSEN 

MKQKTSATVQKDELRTCTADSKATAPAYKPGR 

GTGWSNSEKHADHRSTLTKKMHIQSAVSKMNP 

GEKEPIHRGTTEVNIDSETVHRMLLSAPSENDRV 

QKNLKNTAAEEPJVAQGDATLEHSTNLDSSPSLSS 

VTVVPLRESYDPDVIPLFDKRTVLEGSTASTSPAD 

HSALPNQSLTVRESEVLKTSDSKEGGEGFTVDTP 

AKASITSKRHIPEAHQATLLDGKQGKVIMPLGSK 

LTGVIVENENTTKEGGLVDMAKKENDLNAEPNL 

KQTIKATVENGKKDGIAVDHVVGLNTEKYAETV 

KLKHKRSPGKVKDISIDVERRNENSEVDTSAGSG 

SAPSVLHQRNGQTEDVATGPRRAEKTSVATSTE 

GKDKDVTLSPVKAGPATTTSSETRQSEVALPCTS 

lEADEGLIIGTHSRNNPLHVGAEASECTVFAAAEE 

GGAVVTEGFAESETFLTSTKEGESGECAVAESED 

R^ADLLAVHAVKIEANVNSVVTEEKDDAVTSAG 

SEEKCDGSLSRDSEIVEGTITFISEVESDGAVTSAG 

TEIRAGSISSEEVDGSQGNMMRMGPKKETEGTV ' 

TCTGAEGRSDNFVrCSVTGAGPREERMVTGAGV 

VLGDNDAPPGTSASQEGDGSVNDGTEGESAVTS 

TGITEDGEGPASCTGSEDSSEGFAISSESEENGESA 
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SEQlD 
NO; 


Method 

« 


Predicted 

begioaing 

nucleotide 

location 

corresponding 

to first ami DO 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A«Aianine C=Cysteine, D^Aspartlc Acid, 
E^lutamic Acid, F»Phenylalanine, G^GIycine, H^Histidtoe, 
I^Isoleucine, K^Lysine, L^Leudne, M^^ethionine, 
N«Asparaglne, P^Prollne, Q«=GlutBniine, R=Arginine, S=sSerine, 
T=Threonine, V=Vallne, W^Tryptoplian, Y=Tyrosiiie, 
X^Unknown, *='Stop codon, /^possible nucleotide deletion, 
\Bpossible nucleotide insertion 










JSmSTVAKEGTNVPLVAAGPCDDEGIVTSTGAKE 

EDEEGEDWTSTGRGNEIGHASTCTGLGEESEGV 

LICESAEGDSQIGTVVEHVEAEAGAAIMNANENN 

VDSMSGTEKGSKDTDICSSAKGIVESSVTSAVSG 

KDEVTPVPGGCEGPMTSAASDQSDSQLEKVEDT 

TISTGLVGGSYDVLVSGEVPECEVAHTSPSEKED 

EDnTSVENEECDGLMATTASGDITNQNSLAGGK 

NQGKVLnSTSTTNDYTPQVSAITDVEGGLSDALR 

TEENMEGTRVTTEEFEAPMPSAVSGDDSQLTASR 

SEEKDECAMISTSIGEEFELPISSATTIKCAESLQP 

VAAAVEERATGPVLISTADFEGPMPSAPPEAESP 

LASTSKEEKDECALISTSIAEECEASVSGWVESE 

NERAGTVMEEKDGSGIISTSSVEDCEGPVSSAVP 

QEEGDPSVTPAEEMGDTAMISTSTSEGCEAVMIG 

AVLQDEDRLTITRVEDLSDAAIISTSTAECMPISA 

SIDRHEENQLTADNPEGNGDLSATEVSKHKVPM 

PSLIAENNCRCPGPVRGGKEPGPVLAVSTEEGHN 

GPSVHKPSAGQGHPSAVCAEKEEKHGKECPEIGP 

FAGRGQKESTLHLINAEEKNVLLNSLQKEDKSPE 

TGTAGGSSTASYSAGRGLEGNANSPAHLRGPEQ 

TSGQTAKDSS VSSIRYLAA VNTGAIKADDMPPVQ . 

GTVAEHSFLPAEQQGSEDNLKTSTTKCITGQESKI 

APSHTMIPPATYSVALLAPKCEQDLTIKNDYSGK 

WTDQASAEKTGDDNSTRKSFPEEGDIMVTVSSE 

ENVCDIGNEESPLNVLGGLKLKANLKMEAYVPS 

EEEKKGEILAPPESLCGGKPSGIAELQREPLLVNE 

SLNVENSGFRTNEEIHSESYNKGEISSGRKDNAE 

AISGHSVEADPKEVEEEERHMPKRKRKQHYLSSE 

DEPDDNPDVLDSRIETAQRQCPETEPHATKEENS 

RDLEELPKTSSETNSTTSRVMEEKDEYSSSETTGE 

KPEQNDDDTIKSQE 


3915 


A 


1 


7545 


PGIRVGITSQTGLSSNLQENCSKLAFISSHGTEKQ 

LQCMPMEGRGRASSSISDLQGKGFEKGTGEKHV 

PGVGSARHSPQASAGGSPWQRGKAQTRWLGKP 

DPGRKRRRGSPQEEGGLRVSAAARLLCSGANRC 

KVLVRQNSTPNTQQPAVHPSTPPSRPLPQAGRCL 

VAPLRPHPDWVAAKTLAKALRAPGKPWRLAAP 

SPLGDLGAPGLPGPSTAPRTLSVEEPGVECNQLC 

LYADVTDPVLCLGQKDPGVEGKHCEKEKISSSK 

ELKHVHAKSEPSKPARRLSESLHVVDENKNESKI 

EREHKRJRTSTPVIMEGVQEETDTRDVKRQVERSE 

ICTEEPQKQKSTLKNEKHLKXDDSETPHLKSLLK 

KEVKSSKEKPEREKTPSEDKLSVKHKYKGDCMH 

KTGDETELHSSEKGLKVEENIQKQSQQTKLSSDD 

KTERKSKHRNERKLSVLGKDGKPVSEYIIKTDEN 

VRKENNKKERRLSAEKTKAEHKSRRSSDSKIQK 

DSLGSKQHGITLQRRSESYSEDKCDMDSTNMDS 

NLKPEEVVHKEKRRTKSLLEEKLVLKSKSKTQG 

KQVKVVETELQEGATKQATTPKPDKEKNTEEND 

SEKQRKSKVEDKPFEETGVEPVLETASSSAHSTQ 

lOSSHRAKLPLAKEKYKSDKDSTSTRLERKLSD 

GHKSRSLKHSSKDIKKKDENKSDDKDGKEVDSS 

HEKARGNSSLMEKKLSRRLCENRRGSLSQEMAK 

GEEKLAANTLSTPSGSSLQRPKKSGDMTLIPEQEP 

MEroSEPGVENVFEVSKTQDNRNNNSHQDroSEN 



466 



wo 01/57190 



PCTAJSOl/04098 



SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A^AIaninc C=Cysteine, D=Aspflrtic Acid, 
E^Glutamic Acid, F=Phcnylfllanine, C«=Glycinc, H^Histidine, 
I~Isoleuciae, K-Lysine, Lr=Leucine, M~IVfethionine, 
N»Asparagine, F-Proline, Q^GIutamine, BpArginine, S=Senne, 
Tsn'hreonine, VaValine, W»Tiyptoplian, Y-Oyrosine, 
X=Unknown, *=Stop codon, ^''possible nucleotide deletion, 
^possible nucleotide insertion 




• 






MKQKTSATVQKDELRTCTADSKATAPAYKPGR 

GTGVNSNSEKHADHRSTLTKKMmqSAVSKMNP 

GEKEPmOTTEVNIDSETVHRMLLSAPSENDRV 

QKNLKNTAAEEHVAQGDATLEHSTNLDSSPSLSS 

VTVVPLRESYDPDVIPLFDBCRTVLEGSTASTSPAD 

HSALPNQSLTVRESEVLKTSDSKEGGEGFTVDTP 

AKASITSKRHIPEAHQATLLDGKQGKVIMPLGSK 

LTGVIVENENITKEGGLVDMAKKENDLNAEPNL 

KQTIKATVENGKKDGIAVDHVVGLNTEKYAETV 

KLKHKRSPGKVKDISIDVERRNENSEVDTSAGSG 

SAPSVLHQKNGQTEDVATGPRRAEKTSVATSTE 

GKDKDVTLSPVKAGPA in SSETRQSEVALPCTS 

lEADBGLnGTHSRNNPLHVGAEASECTVFAAAEE 

GGAWTEGFAESETFLTSTKEGESGECAVAESED 

RAADLLAVHAVKIEAKVNSVVTEEKDDAVTSAG 

SEEKCDGSLSRDSEIVEGTITFISEVESDGAVTSAG 

TEIRAGSISSEEVDGSQGNMMRMGPKKETEGTV 

TCTGAEGRSDNFVICSVTGAGPREERMVTGAGV 

VLGDNDAPPGTSASQEGDGSVNDGTEGESAVTS 

TGITEDGEGPASCTGSEDSSEGFAISSESEENGESA 

MDSTVAKEGTNVPLVAAGPCDDEGIVTSTGAKE 

EDEEGEDWTSTGRGNEIGHASTCTGLGEESEGV 

LICESAEGDSQIGTVVEHVEAEAGAAIMNANENN 

VDSMSGTEKGSKDTDICSSAKGIVESSVTSAVSG 

KDEVTPVPGGCEGPMTSAASDQSDSQLEKVEDT 

TISTGLVGGSYDVLVSGEVPECEVAHTSPSEKED 

EDIITSVENEECDGLMATTASGDITNQNSLAGGK 

NQGKVLnSTSTTNDYTPQVSAITDVEGGLSDALR 

TEENMEGTRVTTEEFEAPMPSAVSGDDSQLTASR 

SEEKDECAMISTSIGEEFELPISSATTIKCAESLQP 

VAAAVEERATGPVLISTADFEGPMPSAPPEAESP 

LASTSKEEKDECALISTSIAEECEASVSGVVVESE 

NERAGTVMEEKDGSGIISTSSVEDCEGPVSSAVP 

QEEGDPSVTPAEEMGDTAMISTSTSEGCEAVMIG 

AVLQDEDRLTITRVEDLSDAAIISTSTAECMPISA 

SIDRHEENQLTADNPEGNGDLSATEVSKHKVPM 

PSLIAENNCRCPGPVRGGKEPGPVLAVSTEEGHN 

GPSVHKPSAGQGHPSAVCAEKEEKHGKECPEIGP 

FAGRGQKESTLHLINAEEKNVLLNSLQKEDKSPE 

TGTAGGSSTASYSAGRGLEGNANSPAHLRGPEQ 

TSGQTAKDSSVSSIRYLAAVNTGAIKADDMPPVQ 

GTVAEHSFLPAEQQGSEDNLKTSTTKCITGQESKl 

APSHTMIPPATYSVALLAPKCEQDLTIKNDYSGK 

WTDQASAEKTGDDNSTRKSFPEEGDIMVTVSSE 

EEEKNGEILAPPESLCGGKPSGIAELQREPLLVME 

SLNVENSGFRTNEEIHSESYNKGEISSGRKDNAE 

AISGHSVEADPKEVEEEERHMPBCRKRKQHYLSSE 

DEPDDNPDVLDSRIETAOROCPETEPHATKEENS 

RDLEELPKTSSETNSTTSRVMEEKDEYSSSETTGE 

KPEQNDDDTIKSQE 


3916 


A 


2 


773 


GPFGA^WPSAKPGPVTAVEARPPDASDPEGLRG 
GSPAPLLAPGPLDPSGRLHPAVSMMSYLKQPPYG 
MNGLGLAGPAMDLLHPSVGYPATPRKQRRERTT 
FTRSQLDVLEALFAKTRYPDIFMREEVALKINLPE 



467 



wo 01/57190 



PCT/USOl/04098 



S£QU> 
NO; 


Method 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 

ft aA -ft 

nucleotide 

location 

corresponding 

to last amino 

acid residue of 

peptide 

sequence 


Amino acid sequence (A'='Alanine C^Cysteine, I>=:Aspartic Acid, 
E^Glutamic Acid, F>'Phenylalanine, G^lycine, H°Histidine« 
lalsolcud ne, K»Lysine, L«Leucioe, M=Methiontne, 
NaAsparagine, F^Proline, Q=GlutamiDe, R=»Ai^inine, S^crine, 
T=Threonine, V=Valine, W=Tryptophan, Y=Tyrosine, 
X^Unlcnown, ^»Stop codon, /^possible nucleotide deletion, 
\F=possible nucleotide insertion 










SRVQVWFKNRRAKCRQQQQSGSGTKSRPAKKK 
SSPVRESSGSESSGQFTPPAVSSSASSSSSASSSSA 
NPAAAAAAGLWAKLPCPLHIFSLCVFIEENRLV 
SGSWARDIRSVEETDKSGYR 


3917 


A 


2 


776 


RNPGRRFRPPGLRRLLKGPHMPREPRGYRTRVP 

ALRELVPSSHAGSGASEHCQNNRQGSRQHRASR 

NVQAGGALAPPRHLCGLCSRLHFLKPDLSVRAA 

PSRAGASVMALRKELLKSIWYAFTALDVEKSGK 

VSKSQLRVLSHNLYTVLHIPHDPVALEEHFRDDD 

DGPVSSQGYMPYLNKYILDKVEEGAFVKEHFDE 

LCWTLTAKKNYRADSNGNSMLSNQDAFRLWCL 

FNFLSEDKYPLIMDPDEGEYLLKRYS 


3918 


A 


10 


318 


WQDLVCLGGSRAQEQKPLQQLWNAILLVAMLL 
CTGLWQAQRQASRQSQRELGGQVDLFKRRVV 
RRLASLKTRRCRLSRAAQGLPDPGAETCAVCLD 
YFCNKQ 


3919 


A 


1 


204 


RVLTAINHTLKENLRKFYKGKKJDKPLDLRPKKT 

RAMRRRLNMHEENLKIKKQHRKERLYPLRKYA 

AKA 


3920 


A 


1 


654 


RCCRSFVAPLQEKWFGLFFLGAILCLSFSWLFHT 

VYCHSEGVSRLFSKLDYSGIALLIMGSFVPWLYY 

SFYCNPQPCFIYLIVICVLGIAAIIVSQWDMFATPQ 

YRGVRAGVFLGLGLSGIIPTLHYVISEGFLKAATI 

GQIGWLMLMASLYITGAALYAARIPERFFPGKCD 

IWFHSHQLFfflFWAGAFVHFHGVSNLQEFRFMI 

GGGCSEEDAL 


3921 


A 


1587 


452 


LERDGCGGEEGGSVRSGAGPDSDPRGASSPPAG 

HRGTAASPRPVAAPSRTPAPPHTRARASPGLPSG 

PAWRRVQWFSRVSGQVSTLMKATVLMRQPGRV 

QEIVGALRKGGGDRLQVISDFDMTLSRFAYNGK 

RCPSSYNILDNSKIISEECRKELTALLHHYYPIEID 

PHRTVKEKLPHMVEWWTKAHNLLCQQKIQKFQI 

AQWRESNAMLREGYKTFFNTLYHNNIPLFIFSA 

GIGDILEEIIRQMKVFHPNIHIVSNYMDFNEDGFL 

QGFKGQLfflTYNKNSSACENCGYFQQLEGKTNV 

ILLGDSIGDLTMADGVPGVQNILKIGFLNDKVEE 

RRERYMDSYDIVLEKDETLDVVNGLLQHILCQG 

VQLEMQGP 


3922 


A 


2 


164 


GKIYQRAFGGHSLKFGKGVQAHGCCCVADRTG 
HSBLHTSYGRERPAPVHLRQDT 


3923 


A 


2 


3258 


EHATHAYAKLGTRRRHREVTVFVPTWQLKK'NR 

RVRESHFLTKLHSLKMLSITPSQLENGKKJTTYD 

YRFMVKLAEETDGIIVTNEQmiLMNSSKKLNlVK 

DRLLPFTFAGNLFMVPDDPLGRDGPTLDEFLKKP 

NRLDTDIGNFLK V WKTLPPSSAS VTEL SDDADSG 

PLESLPNMEEVREEKEERQDEEQRQGQGTQKAA 

EEDDLDSSLASVFRVECPSLSEEILRCLSLHDPPD 

GALDIDLLPGAASPYLGIPWDGKAPCQQVLAHL 

AQLTIPSNFTALSFFMGFMDSHRDAIPDYEALVG 

PLHSLLKQKPDWQWDQEHEEAFLALKRALVSAL 

CLMAPNSQLPFRLEVTVSHVALTAILHQEHSGRK 

HPIAYTSKPLLPDEESQGPQSGGDSPYAVAWALK 

HFSRCIGDTPWLDLSYASRTTADPEVREGRRVS 

KAWLIRWSLLVQDKGKRALELALLQGLLGENRL 

LTPAASMPRFFQVLPPFSDLSTFVCIHMSGYCFYR 
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1 SEQID 
NO: 


Method 


Predicted 

beginning 

nucleotide 

toeation 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
Jocatioji 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A'^AIanine C^ysteine, D=:Aspartic Acid, 
E=Glutamlc Acid, F=Phcnylalanine, G^lyclne, H=Histidine, 
l^lsoleudne, K»Lysine, I^Leucine, MBMetfaionine, 
N^Asparagine, P^Proline, Q=GIntamine, R=Arginine, S=Serine, 
T=Threonine, V=»ValIne, W=Tryptophan, Y=Tyrosinc, 
X=Unknown, *=^top codon, /^possible nucleotide deletion, 
\spossible nucleotide insertion 










EDEWCAGFGLYVLSPTSPPVSLSFSCSPYTPTYA 

HLAAVACGLERFGQSPLPWFLTHCNWIFSLLWE 

LLPLWRARGFLSSDGAPLPHPSLLSYIISLTSGLSS 

LPFIYRTSYRGSLFAVTVDTLAKQGAQGGGQWW 

SLPKDVPAPTVSPHAMGKRPNLLALQLSDSTLAD 

IIARLQAGQKLSGSSPFSSAFNSLSLDKESGLLMF 

KGDKKPRVWVVPTQLRRDLIFSVHDIPLGAHQR 

PEETYKKLRLLGWWPGMQEHVKDYCRSCLFCIP 

RNLIGSELKVIESPWPLRSTAPWSNLQIEWGPVT 

ISEEGHKHVLIVADPNTRWVEAFPLKPYTHTAVA 

QVLLQHVFARWGVPVRLEAAQGPQFARHVLVS 

CGLALGAOVASLSRDLOFPCLTSSGAYWEFKRA 

LKEFIFLHGKKWAASLPLLHLAFRASSTDATPFIC 

VLTGGESRLTEPLWWEMSSANIEGLKMDVFLLQ 

LVGELLELHWRVADKASEKAENRRFKRESQEKE 

WNVGDQVLLLSLPRNGSSAKWVGPFYIGDRLSL 

SLYRIWGFPTPEKLGCIYPSSLMKAFAKSGTPLSF 

KVLEQ 


3924 


A 


1 


1826 


MGSVTVRYFCYGCLFTSATWTVLLFVYFNFSEV . 

TQPLKm^VKGSGPHGPSPKKFYPRFTRGPSRVL 

EPQFKANKIDDVIDSRVEDPEEGHLKFSSELGMIF 

NERDQELRDLGYQKHAFNMLISDRLGYHRDVPD 

TRNAACKEKFYPPDLPAASWICFYNEAFSALLR 

TVHSVIDRTPAHLLHEIELVDDDSDFDDLKGELDE 

YVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATG 

EVLVFLDSHCEVNVMWLQPLLAAIREDRHTVGC 

PVmnSADTLAYSSSPWRGGFNWGLHFKWDLV 

PLSELGRAEGATAPIKSPTMAGGLFAMNRQYFH 

ELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 

CSRVGHIFRKRRPYGSPEGQDTMTHNSLRLAHV 

WLDEYKEOYFSLRPDLKTKSYGNISERVELRKKL 

GCKSFKWYLDNVYPEMQISGSHAKPQQPIFVNR 

GPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKG 

GLVVLKACDYSDPNQIWIYNEEHELVLNSLLCLD 

MSETRSSDPPRLMKCHGSGGSQQWTFGKNNRLY 

QVSVGQCLRAVDPLGQKGSVAMAICDGSSSQQ 

WHLEG 


3925 


A 


5386 


2897 


VRWNSKTECYLSIQTQENFPANLNELVNCIVISSL 

VTTQRKLKAMSLLGSRNQLARAVLNPNPMDFCT 

KDLLTTTSERIIAYLRDFNEDQKKAIETAYAMVK 

HSPSVAKICLmGPPGTGKSKTIVGLLYRLLTENQ 

RKGHSDENSNAKIKQNRVLVCAPSNAAVDELM 

KKIILEFKEKCKDKKNPLGNCGDINLVRLGPEKSI 

NSEVXKJ^SLDSQVNHRMKKELPSHVQAMHKRK 

EFLDYQLDELSRQRALCRGGREIQRQELDENISK 

VSKERQELASKIKEVQGRPQKTQSaiLESHIICCT 

LSTSGGLLLESAFRGQGGVPFSCVIVDEAGQSCEI 

ETLTPLIHRCNKLILVGDPKQLPPTVISMKAQEYG 

YDQSMMARFCRLLEENVEHNMISRLPILQLTVQ 

YRMHPDICLFPSNYVYNRNLKTNRQTEAIRCSSD 

WPFQPYLVFDVGDGSERRDNDSYINVQEIKLVM 

EIIKLIKDKiaCDVSFRMGnTHYKAQKTN^ 

DKEFDRKGPAEVDTVDAFQGRQKDCVrVTCVRA 

NSIQGSIGFLASLQRLNVTITRAKYSLFILGHLRTL 

MENQHWNQLIQDAQKRGAIIKTCDKNYRHDAV 
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SEQ ID 


Method 


Predicted 

Deginning 

nncleotide 

location 

corresponding 

to first amino 

add residue of 

peptide 

sequence 


Predicted end 
nucicouae 
location 
corresponding 
to last amino 
acid residue of 
peptide 
sequence 


Amino acid sequence (A=='Alanine C==Cysteine, fNAspartic Acid, 
j&>^iuianiic ACiOi r-'menyinianine, v^=\yiycinei ti^Histidlne, 
]«JsoIeacIne, K-Lysine^ X^Leuclne, M^Methioniney 
Ni^Asparagine, F=Proline, Q=Glutaniine, R<^Aiginioe, S»Serine, 
T=Threonine, V^Valinc, W=Tryptopi»an, Y=Tyrosinc, 
X^Unlinoifvn, *-Stop codon, /=possible nucleotide deletion, 
V^possible nucleotide insertion 










KIUCLKPVLQRSLTHPPTIAPEGSRPQGGLPSSKL 

DSGFAKTSVAASLYHTPSDSKEITLTVTSKDPERP 

PVHDQLQDPRLLKRMGBEVKGGIFLWDPQPSSPQ 

HPGATPPTGEPGFPVVHQDLSHVQQPAAWAAL 

SSHKPPVRGEPPAASPEASTCQSKCDDPEEELCH 

RREARAFSEGEQEKCGSETHHTRRNSRWDKRTL 

EQEDSSSKKRXLL 


3926 


A 


99 


284 


MPREDRATWKSNYFLKJIQLLDDYPKRFIVGANN 
VGSKQMQQIRMSLRGKAVVLMGKNTMMR 


3927 


A 


542 


2 


AHLLMLNLAL\TDLL\YLTSLPFLIHYYASGENWI 
FGDFMCKFIRPSFHFNLYSSILFLTCFSIFRYCVIIH 
PMSCFSIHKTRCAWACAVVWnSLVAVIPMTFLI 
TSTNRTNRSACLDLTSSDELNTIKWYNLILTA\LL 
CLPLVIVTLCYmiHTLTHGHAN\DSCLKQKARR 
LTILLL 


3928 


A 


1 


1516 


GEEAVGGGAEGGGFGVGAQGRAGGRGVEAGR 

MRLSKTLVDMDMADYSAALDPAYTTLEFENVQ 

VLTMGNDTSPSEGTNLNAPNSLGVSALCAICGDR 

ATGKHYG AS SCDGCKGFFRRSVRKNHMYSCRFS 

RQCVVDKDKRNQCRYCRLKKCFRAGMKKEAV 

QNERDRISTRRSSYEDSSLPSINALLQAEVLSRQIT 

SPVSGINGDIRAKKIASIADVCESMKEQLLVLVE 

WAKYIPGFCELPLDDQGALLRAHAGEHLLLGAT 

KRSMVFKDVLLLGNDYIVPRHCPELAEMSRVSIR 

ILDELVLPFQELQIDDNEYAYLKAHFFDPDAKGL 

SDPGKIKRLRSQVQVSLEDYINDRQYDSRGRFGE 

LLLLLPTLQSITWQNflEQIQFIKLFGMAKIDNLLQ 

EMLLGGSPSDAPHAHHPLHPHLMQEHMGTOVIV 

A>rrMPTjHLSNGQMCEWPRPRGQAATPETPQPSP 

PGASGSEPYKLLPGAVATIVKPLSAIPQPTITKQE 

VI 


3929 


A 


1 


2782 


RVLSLESPLEKDPRVLGAQSVPRGRALKGLSPLG 

LDSAFRLFPDPRAGPWNTAVLSSGMEPETALWG 

PDLQGPEQSPNDAHRGAESENEEESPRQESSGEEI 

IMGDPAQSPESKDSTEMSLERSSQDPSVPQNPPTP 

LGHSNPLDHQIPLDPPAPEVVPTPSDWTKACEAS 

WQWGALTTWNSPPVVPANEPSLRELVQGRPAG 

AEKJPYICNECGKSFSQWSKLLRHQRIHTGERPNT 

CSECGKSFTQSSHLVQHQRTHTGEKPYKCPDCG 

KCFSWSSNLVQHQRTHTGEKPYKCTECEKAFTQ 

STNLIKHQRSHTGEKPYKCGECRRAFYRSSDLIQ 

HQATHTGEKPYKCPECGKRFGQNHNLLKHQKIH 

AGEKPYRCTECGKSFIQSSELTQHQRTHTGEKPY 

ECLECGKSFGHSSTLIKHQRTHLREDPFKCPVCG 

KTFTLSATLLRHQRTHTGERPYKCPECGKSFSVS 

SNLINHQRJHRGERPYICADCGKSFIMSSTLIRHQ 

RIHTGEKPYKCSDCGKSFIRSSHLIQHRRTHTGEK 

PYKCPECGKSFSQSSNLITHVRTHMDENLFVCSD 

CGKAFLEAHELEQHRVIHERGKTPARRAQGDSL 

LGLGDPSLLTPPPGAKPHKCLVCGKGFNDEGIFM 

QHQRIHIGENPYKNADGLIAHAAPKPPQLRSPRL 

PFRGNSYPGAAEGRAEAPGQPLKPPEGQEGFSQR 

RGLLSSKTYICSHCGESFLDRSVLLQHQLTHGNE 

KPFLFPDYRLGLGEGAGPSPFLSGKPFKCPECKQS 

FGLSSELLLHQKVHAGGKSSHKSPELGKSSSVLL 
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SEQ ID 
NO: 


Method 


Predietcd 

bcEiDnins 

nucleotide 

location 

corresponding 

to first amino 

acid residue of 

peptide 

sequence 


Predicted end 
nucleotide 
locadon 
corresponding 
to last amino 
acid residue of 
peptide 
seqaence 


Amino add sequence <A«Alanlne C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, F^Phcnylalanine, G^^Glycine, He^Histidine, 
I=Isoleuctne, K==Ly$ine, L=Leudne, M^Metbionine, 
N-Asparagine, P=ProUnc, Q^GIutamine, R^Arginine, S^Serine, 
T^Threonine, V-Valinc, W=Tryptophan, Y-Tyrosine, 
X»Unknown, *BStop codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion 










EHLRSPLGARPYRCSDCRASFLDRVALTRHQETH 

TQEKPPNPEDPPPEAVTLSTDQEGEGETPTPTESS 

SHGEGQNPKTLVEEKPYLCPECGAGFTEVAALLL 

HRSCHPGVSL 


3930 


A 


513 


273 


KTQEllIIYISEHIFFPFLQGFGNLPICMAKTDLSLS 

HQPDKKGVPSDFILPISDVRASIGAGFIYPLVGTG 

SRESPLWL 


3931 


A 


16 


305 


KRRDFLSCWPAFTVLGEARGDQVDWSKLYRDT 
GLVKMSRKPRASSPFSNNHPSTPKRRGRGKHPLI 
PGPEALSKFPRQPIREKGPVKEVPGTKGSP 


3932 


A 


16 


305 


KRRDFLSCWPAFTVLGEARGDQVDWSKLYRDT 
GLVKMSRKPRASSPFSNNHPSTPKRRGRGKHPLI 
PGPEALSKFPRQPIREKGPVKEVPGTKGSP 


3933 


A 


1 


1546 


STHASEHWDSALQLAKHLAPDQIPFISKEYAIQLE 

FAGDYVNALAHYEKGITGDNKEHDEACLAGVA 

QMSIRMGDIRRGVNQALKHPSRVLKRDCGAILE 

NMKQFSEAAQLYEKGLYYDKAASVYIRSKNWA 

KVGDLLPHVSSPKIHLQYAKAKEADGRYKEAW 

AYENAKQWQSVIRIYLDHLNNPEKAVNIVRETQ 

SLDGAKMVARFFLQLGDYGSAIQFLVMSKCNNE 

AFTLAQQHNKMEIYADnCSEDTTNEDYQSIALY 

FEGEKRYLQAGKFFLLCGQYSRALKHFLKCPSSE 

DNVAIEMAIETVGQAKDELLTNQLIDHLLGEND 

GMPKDAKYLFRLYMALKQYKEAAQTAIIIAREE 

QSAGNYRNAHDVLFSMYAELKSQKIKIPSEMAT 

NLMILHSYILVKIHVKNGDHMKGARMLIRVANN 

ISKFPSHIVPILTSTVIECHRAGLKNSAFSFAAML 

MRPEYRSKroAKYKKKffiGNT^nRRPDISEIEEATTP 

CPFCKFLLPESELL 


3934 


A 


334 


1268 


PTRRPILPLTSPKAISVPSPLQGKQHIXVKSCLSVS 

GIGGFLVSLSSRMKLQTLAVSVTALKFWSAYVP 

CQTQDRDALRLTLEQIDLIRRMCASYSELELVTS 

AKALNDTQKLACLIGVEGGHSLDNSLSILRtFYM 

LGVRYLTLTHTCNTPWAESSAKGVHSFYNNISGL 

TDFGEKVVAEMNRLGMMVDLSHVSDAVAKRAL 

EVSQAPVIFSHSAARGVCNSARNVPDDILQLLEE 

ERWAFVMVSLFHGELIQWQPIRPMCSTVADHFD 

HIKAVUGSKFIGIGGDYDGAGKYRKKTTCKAPW 

RTSSRMSS 


3935 


A 


1 


S83 


HETTPAVVQSVLLERGWNKFDKQEQNAEDWNL 

YWRTSSFRMTEHNSVKPWQQLNHHPGTTKLTR 

KDCLAKHLKHMRRMYGTSLYQFIPLTFVMPNDY 

TKFVAEYFQERQMLGTKHSYWICKPAELSRGRG 

ILIFSDFKDFIFDDMYIVQKYISNPLLIGRYKCDLR 

lYVCVTGFKPLTIYVYQEGLVRFATEKFDLSNLQ 

NNYAHLTNSSINKSGASYEKIKEVIGHGCKWTLS 

RFFSYLRSWDVDDLLLWKKIHRMVILTILAIAPS 

VPFAANCFELFGFDILIDDNEFHRTG 


3936 


A 


203 


441 


HLAHSLGPLPKHYQYCVRYLYYQVTKDVIKEFA 
DDGVKYLELRSTPRRENATGMTKKTYVESILEGI 
KQSKQENLDIDV 
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SEQ ID NO: 


Position of end of 
Signal in Amino Acid 
Sequence 


MaxS (MAXIMUM 
SCORE) 


MeanS (Mean Score) 


1 


19 


0.930 


0.680 


2 


24 


0.964 


0.863 


3 


21 


0.990 


0.901 


4 


19 


0.981 


0.942 


5 


22 


0.991 


0.928 


6 


21 


0.956 


0.843 


8 


22 


0.913 


0.718 


9 


17 


0.997 


0.969 


11 


19 


0.930 


O.680 


13 


36 


0.983 


0.863 


14 


28 


0.935 


0.839 


15 


21 


0.997 


0.955 


16 


16 


0.983 


0.944 


17 


18 


0.989 


0.884 


19 


49 


0.996 


0.719 


20 


28 


0.972 


0.920 


21 


23 


0.954 


0.905 


22 


46 


0.955 


0.568 


23 


26 


0.942 


0.654 


24 


19 


0.979 


0.941 


25 


34 


0.884 


0.565 


26 


33 


0.934 


0.584 


27 


17 


0.975 


0.914 


28 


18 


0.980 


0.934 


29 


23 


0.928 


0.718 


30 


26 


0.978 


0.885 


32 


20 


0.946 


0.719 


33 


29 


0.933 


0.671 


35 


25 


0.996 


0.920 


36 


26 


0.903 


0.579 


40 


19 


0.981 


0.942 


47 


25 


0.971 


0.909 


53 


22 


0.991 


0.928 


55 


24 


0.960 


0.808 


60 


19 


0.986 


0.967 


78 


22 


0.913 J 


0.718 


86 


20 


0.883 


0.555 


87 


24 


0.982 


0.889 


88 


17 


0.997 


0.969 


115 


19 


0.930 


0.680 


134 


36 


0.983 


0.863 


136 


17 


0.913 


0.696 


137 


19 


0.958 


0.905 


140 


28 


0.935 


0.839 


143 


32 


0.914 


0.740 


153 


21 


0.997 


0.955 


154 


25 


0.913 


0.583 


155 


29 


0.972 


0.857 


169 


30 


0.977 


0.817 


170 


30 


0.977 


0.819 


171 


30 


0.977 


0.819 


175 


47 


0.926 


0.606 


176 


30 


0.968 


0.872 


177 


22 


0.957 


0.791 


192 


43 


0.930 


0.678 
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S£Q ID NO: 


Position of end of 
Signal in Amino Acid 
Sequence 


MaxS (MAXIMUM 
SCORE) 


MeanS (Mean Score) 


195 


19 


0.956 


0.860 




21 


0.982 


0.871 


203 


24 


0.957 


0.870 


207 


23 


0.954 


0.905 


224 


46 


0.955 


0.568 


225 


26 


0.942 


0.654 


228 


45 


0.961 


0.839 


231 


28 


0.994 


0.937 


232 


28 


0,993 


0.896 


234 


19 


0.979 


0.942 


235 


19 


0.979 


0.941 


238 


20 


0.987 


0.943 


244 


23 


0.929 


0.683 


250 


34 


0.884 


0.565 


256 


33 


0.934 


0.584 


258 


25 


0.934 


0.729 


259 


22 


0.969 


0.871 


264 


19 


0.952 


0.753 


265 


17 


0.975 


0.914 


266 


17 


0.975 


0.914 


271 


23 


0.974 


0.884 


274 


13 


0.971 


0.834 


275 


18 


0.980 


0.934 


278 


32 


0.958 


0.668 


280 


24 


0.966 


0.881 


281 


24 


0.966 


0.881 


286 


23 


0.928 


0.718 


291 


35 


0.991 


0.824 


293 


27 


0.956 


0.806 


294 


23 


0.952 


0.827 


301 


26 


0.978 


0.885 


316 


20 


0.946 


0,719 


320 


28 


0.978 


0.726 


327 


29 


0.933 


0.671 


331 


48 


O.903 


0,571 


345 


25 


0.996 


0.920 


349 


26 


0.903 


0.579 


351 


24 


0.951 


0.876 


352 


18 


0.944 


0.716 


353 


32 


0.992 


0.854 


354 


27 


0.945 


0.817 


355 


16 


0.922 


0.716 


356 


13 


0.959 


0.818 


357 


23 


0.986 


0.878 


358 


19 


0.904 


0.671 


359 


16 


0.988 


0.951 


360 


15 


0.981 


0.938 


361 


18 


0.944 


0.716 


362 


21 


0.984 


0.869 


363 


40 


0.979 


0.813 


364 


18 


0.883 


0.693 


365 


22 


0.962 


0.908 


366 


22 


0.961 


0.827 


367 


44 


0.941 


0.624 


368 


20 


0.952 


0.791 


369 


22 


0.949 


0.840 


370 


28 


0.957 


0.682 
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SEQ ID NO: 


Position of end of 


MaxS (MAXIMUM 


MeanS (Mean Score) 


Signal in Amino Acid 
Sequence 


SCORE) 




372 


28 


0.974 


0.894 


373 


19 


0.972 


0.947 


374 


29 


0.968 


0.785 


375 


19 


0,949 


0.897 


371 


23 


0.962 


0.910 


378 


31 


0.974 


0.895 


379 


26 


0.969 


0.939 


3S0 


27 


0.945 


0.817 


383 


27 


0.945 


A A 1 

0.817 


384 


25 


0.992 


A A ^1^^ 

0.877 


385 


32 


0.983 


0.825 


386 


44 


0.924 


0.564 


387 


26 


0.971 


0.894 


388 


19 


0.989 


0.862 


389 


24 


0.990 


0.947 


390 


34 


0.942 


0.635 


391 


16 


0.922 


0.716 


394 


19 


0.987 


0.970 


398 


36 


0.992 


0.866 


404 


13 


0.959 


0.818 


417 


23 


0.986 


0.878 


421 


19 


0.904 


0.671 


425 


28 


0.971 


0.717 


431 


16 


0,988 


0.951 


452 


18 


0.944 


0.716 


459 


21 


0.991 


0.902 


468 


21 


0.984 


0.869 


478 


40 


0.979 


0.813 " 


486 


18 

* 


0,883 


0.693 


499 


22 


0.962 


0.908 


501 


19 


0.962 


0.877 


514 


44 


0.941 


0.624 


529 


20 


0.952 


0.791 


533 


39 


0.914 


0.719 


548 


28 


0.957 


0.682 


561 


28 


0,974 


0.894 


562 


28 


0.974 


0.893 


564 


18 


0.949 


0.806 


576 


19 


0.972 


0.947 


584 


29 


0.968 


0.785 


585 


28 


0.973 


0.810 


591 


19 


0.949 


0.897 


592 


24 


0.991 


0.954 


594 


20 


0.985 


0.959 


595 


20 


0.985 


0.959 


612 


23 


0.962 


0.910 


619 


31 


0.974 


0.895 


621 


15 


0.959 


0.795 


633 


26 


0.969 


A AO A 

0.939 


640 


20 


0,949 


0.842 


645 


25 


0.911 


0.759 


684 


25 


0.992 


\i.ofl 


691 


32 


0.983 


0.825 


698 


44 


0.924 


0.564 


700 


19 


0.982 


0.941 


710 


26 


0.971 


0.894 


714 


23 


0.965 


0.907 



474 



wo 01/57190 



PCT/USOl/04098 



SEQ ID NO: 


Position of end of 
Signal in Amino Acid 
Sequence 


MaxS (JMAXIMUM 
SCORE) 


MeanS (Mean Score) 


718 


19 


0.989 


0.862 


725 


21 


0.976 


0.851 


728 


33 


0.961 


0.895 


734 


25 


0.963 


0.660 


741 


34 


0.942 


0.635 


744 


19 


0.959 


0.924 


747 


16 


0.922 


0.716 


756 


26 


0.973 


0.864 


767 


22 


0.986 


0.943 


768 


27 . 


0.916 


0.758 


769 


19 


0.987 


0.970 


770 


22 


0.981 


0.933 


771 


34 


0.993 


0.893 


773 


20 


0.968 


0.939 


774 


21 


0.971 


0.945 


778 


22 


0.986 


0.943 


779 


32 


0.973 


0.846 


781 


23 


0.950 


0.857 


785 


27 


0.916 


0.758- 


786 


27 


0.916 


0.758 


788 


22 


0.981 


0.933 


793 


22 


0,986 


0.803 


794 


39 


0.892 


0.654 


797 


27 


0.965 


0.847 


810 


22 


0.981 


0.933 


823 


34 


0.993 


0.893 


825 


17 


0,962 


0.778 


837 


20 


0.968 


0.939 


844 


25 


0.984 


0.951 


845 


17 


0.919 


0.706 


846 


21 


0.971 


0.945 


847 


21 


0.971 


0.945 


890 


22 


0.986 


0.943 


893- 


24 


0,971 


0.865 


894 


24 


0.971 


0.865 


896 


32 


0.973 


0.846 


899 


31 


0.982 


0.817 


922 


15 


0.882 


0.706 


924 


21 


0.975 


0.948 


925 


21 


0.927 


0.661 


933 


20 


0.967 


0.906 


960 


20 


0.967 


0.906 


967 


38 


0.970 


0.784 


968 


47 


0.970 


0.557 


972 


36 


0.945 


0.775 



TABLES 



SEQ 

ID 

NO: 


Method 


Predicted 
beginning 
nucleotide 
location 

corresponding to 
first amino acid 
residue of 
peptide sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C-Cysteine» D=Aspartic 
Acid, E^Glutamic Acid, F^Phenylalanine, G=Glycine, 
H=Histidine, I^Isoleucine, K^Lysine, Lr^Leucine, 
M=Methionine, N-Asparagine, P=Proiine, Q==Glutaniine, 
R»Arginine, S=Serine, T=Threonine, V«Valine, 
W=Tryptophan, Y=Tyrosine, X=Unltnown, *=Stop codon, 
/^possible nucleotide deletion, \-possible nucleotide 
insertion 


3955 


A 


235 


1272 


GPREVLAASSLADGSEEQVMAVALVRERDLSFPG * 
VGDAVVNPTRWHLPAQPEMLYEGGEGRMETLK 
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SEQ 
ID 

NO: 


Method 


Predicted 
Degin fling 
nucleotide 
location 

corresponding to 
first amino acid 
residue of 
peptide sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, ]>=4spartic 
ACiQ) vviutaniic /\ciat r— jrnenyiaianine, Lv— ijiycine^ 
H~Histidine, I=Isoleucine, K=Lysine, L=Leucine, 
M'sMethiionine, N=Asparagine, P=Proline, Q^'Glutamine, 
R=Arginine, S=Serine, T-Threonlne, V=Valine, 
W*=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \-possible nucleotide 
insertion 






- 




DKTLQELEELQNDSEAIDQLALESPEVQDLQLERE 

MALATNRSLAERNLEFQGPLEISRSNLSDRYQELR 

KLVERCQEQKAKLEKFSSALQPGTLLDLLQVEGM 

KmEESEAMAEKFLEGEVPLETFLEI^FSSMRMLSH 

LRRVRVEKLQEVVRKPRASQELAGDAPPPRSPPP 

V/PPSPPGNTPCG*RAAAATISHASLPFALQPIPQPA 

CGPHCPWSPATGPFPSSVPALLLQRASGPHLPGSP 

AWTQGCCGLLLVPTEEHAAPPYGFPPPPGPAWPG 

Y 


3956 


A 


821 


385 


SICADRTERVGIFFYIPAGTTDEADVTHP*EGHSYL 
SNHAGIQRSSRP/SHYQGE/WHDNCFTADELQLLT 
YQLCHTYVRCTRSVSIPAPAYYAHLVAFRARYHL 
VDKEHDSAEGSHVSGQSNGRDPQALAKAVQfflQ 

DTLRTMYFA' 


3957 


A 


4621 


240 


ELISTFKLLLEKKRSEVMKMKKRYEVGLEKLDSA 

SSQVATMQMELEALHPQLKVASKEVDEMMIMIE 

KESVEVAKTEKIVBCADETIANEQAMASKAIKDEC 

DADLAGALPILESALAALDTLTAQDITWKSN4KSP 

PAGVKLVMEAICILKGIKADKffDPTGSGKiaEDF 

WGPAKRLLGDMRFLQSLHEYDKDNffPAYMNUR 

KNYIPNPDFVPEKIRNASTAAEGLCKWVIAMDSY 

DKVAKIVAPKJGKLAAAEGELKIAMDGLRKKQA 

ALKEVQDKLARLQDTLELNKQKKADLENQVDLC 

SKXLERAEQLIGGLGGEKTRWSHTALELGQLYIN 

LTGDILISSGWAYLGAFTSTYTtQNQTKEWll'LCK 

GRDIPCSDDCSLMGTLGEAVTIRTWNIAGLPSDSF 

SIDNGIIIMNARRWPLMIDPQSQANKWDCNMEKA 

NSLYVIKLSEPDYVRTLENCIQFGTPVLLENVGEE 

LDPILEPLLLKQTFKQGGSTCIRLGDSnEYAPDFR 

FYITTKLRNPHYLPETSVKVTLLNFMITPEGMQDQ 

LLGIVVAQERPDLEEEKQALILQGAEMCRQLKEIE 

DKILEVLSSSEGNILEDETAIKILSSSKALANEISQK 

QEVAEETEKKIDTTRMGYRPIAfflSSILFFSLADLA 

NIEPMYQYSLTWFINLFILSIENSEKSEILAKRLQIL 

KDHFl YSL YVN VCRSLFEKDKLLFSFCLTIMLLLH 

ERAINKAEWRFLLTGGIGLDNPYANPCTWLPQKS 

WDEICRLDDLPAFKTIRREFMRLKDGWKXVYDSL 

EPHHEVFPEEWEDKANEFQRMLIIRCLRPDKVIPM 

LQEFIINRLGRAFIEPPPFDLAKAFGDSNCCAPLIFV 

LSPGADPMAALLKFADDQGYGGSKLSSLSLGQGQ 

GPIAMKMLEKAVKEGTWVVLQKCHLATSWMPT 

LEKVCEELSPESTHPDFRMWLTSYPSPNFPVSVLQ 

NGVKMTNEAPKGLRANIIRSYLMDPISDPEFFGSC 

KKPEEFKKLLYGLCFFHALVQERRKFGPLWWNIP 

YEFNETDLRISVQQLHMFLNQYEELPYEALRYMT 

GECNYGGRVTDDWDRRTLRSILl^KFFNPELVENS 

DYKFDSSGIYFVPPSGDHKSYIEYTKTLPLTPAPEI 

FGMNANADITKDQSETQLLFDNILLTQSRSAGAG 

AKSSDEVVNEVASDaCKLPNNFDIEAAMRRYPT 

TYTQSMNTVLVQEMGIU^NKLLKriRDSCVNIQICA 

JKGLAVMSTDLEEWSSn-NVKIPEMWMGKLSYPS 

LKPLGSYVNDFLARLKFLQQWYEVGPPPVFWLSG 

FFFTQAFLTGAQQNYARKYTIPIDLLGFDYEVMED 

KEYKHPPEDGVFIHGLFLDGASWNRKIKKLAESH 

PKILYDTVPVMWLKPCKRADPKRPSYVAPLYKT 
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SEQ 

ID 

NO: 


Method 


Predicted 
beginning 
nucleotide 
location 

corresponding to 
first amino acid 
residue of 
peptide sequence 


Predicted end 
nucleotide 
location 
corresponoing 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AlanineO=»Cysteine, D=Aspartlc 
Acid, E=Glutamic Acid, l<--rnenyialanine, G==Giycine, 
H=Histidine, I=Isoleucine, K^Lysine, L=Leucine, 
IVl=I^ethionine. N=AsDaraEine. F=Protine. O^Gliitamine. 
R-Arginine, S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *~Stop codon, 
/^possible nucleotide deletion, V=possible nucleotide 
insertion 










SERRGVLSTTGHSIWVIA\MTLPSDQPKEHWIGR 
GVALLCQLNS 


3958 


A 


35 


529 


GADMAKSKNHTTHNQSRKWHRNVIKKPLSQRYK 

SLKGVDPKFLGl^CFTKKHKKXGLKKMQADSA 

KAVSTCAKAIEALVKPKEVKPKIPKGVSCELN*LA 

YIAYPKFWTCACACIAKGLRLCQPKAKAQDQTK 

AQVQIKAQAAAPASVPTQAPKGAQAPTKASG 


3959 


A 

• 


1S83 

■ 


763 


LLVLLLRTNLLIASSTRISRATLTCSPPGIPVDPRVR 

PRVRSHLVMYLGITTGSLHKAVVSGDSSAHLVEEI 

QLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPR 

ANCSVYESCVDCVLARDPHCAWDPESRTCCLLSA 

PNLNSWKODMERGNPEWACASGPMSRSLRPOSR 

PQUKEVLAVPNSILELPCPHLSALASYYWSHGPAA 

VPEASSTVYNGSLLLIVQDGVGGLYQCWATENGF 

SYPVISYWVDSQDQTLALDPELAGIPREHVKVPLT 

RVSGGAAIAAQQSYWPHFVTVTVLFALVLSGALI 

ILVASPLRALRARGKVQGCETLRPGEKAPLSREQH 

LQSPKECRTSASDVDADNNCLGTEVA 


3960 


A 


1 


481 


SYAAPSLFVKSLYWALAFMAVLLAVSGVVIVVLA 

SRAGARCQQCPPGWVLSEEHCYYFSAEAQAWEA 

SQAFCSAYHATLPLLSHTQDFLGRYPVSRHSWVG 

AWRGPQGWHWIDEAPLPPQLLPEDGEDNLDINCG 

ALEEGTLVAANCSTPRPWVCAKGTQ 



TABLE 9 



SEQ ID NO: 


Accession 
Number 


Species 


Description 


Smith 

Waterman 

Score 


Vo Idenity 


3937 


Y27700 


Homo sapiens 


Human secreted 
protein encoded by 
gene Mo. 12. 


193 


25 


3938 


AF093097 


Homo sapiens 


putative RNA-binding 
protein Q99 


3881 


84 


3939 


ABO 12308 


Anthocidaris 
crassispina 


B2HC 


4169 


74 


3940 


U 10248 


Homo sapiens 


ribosomal protein L29 


787 


95 


3941 


Y99418 


Homo sapiens 


Human PRO 13 17 
(UNQ783) amino acid 
sequence SEQ ID 
KO:277. 


4031 


100 


3942 


AL023516 


Gallus gallus 


B locus C type Lectin 


198 


35 



TABLE 10 



SEQ ID 
NO: 


Accession No. 


Description 


Results* 


3937 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 9.168e-l I 209- 

224 


3942 


BL00615 


C-type lectin domain proteins. 


BL00615A 16.68 6.4O0e-l 137- 
55 



* Results Include in order: accession number subtype; raw score; p-value; position of signature in amino acid 
sequence 
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TABLE 11 



SEQID 
NO: 


PFAM Name 


Description 


P-Value 


PFAM 
Score 


3938 


Piwi 


Piwi domain 


2.6e-150 


512.7 


3940 


Ribosomal L29e 


Ribosomal L29e protein family 


2.3e-19 


77.8 


3941 


Sema 


Sema dojnain 


4e-181 


615.1 


3942 


lectin c 


Lectin C-type domain 


0.086 


-7.1 



5 

TABLE 12 





$£Q ID NO: 


Position of end of 
Signal in Amino Acid 
Sequence 


MaxS (Maximum Score) 


Means (Mean Score) 




3941 


31 


0.985 


0.926 




3942 


21 


0.974 


0.894 


10 






TABLE 13 





SEQ ID NO: 
of full length 
nucleotide 
sequence 


SEQID 
NO: of full 
length 
peptide 
sequence 


SEQ ID NO: 
of contig 
nucleotide 
sequence 


SEQ ID NO: 
of contig 
peptide 
sequence 


Priority Docket 
number 

corresponding SEQ 
D) NO: in priority 
application 


SEQ ID NO: in 
USSN 09/496,914 


3937 


3943 


3949 


3955 


787CIP2G_l 


787 3587 


3938 


3944 


3950 


3956 


787CIP2G 2 


787 3813 


39^9 


3945 


3951 


3957 


787CIP2G 3 


787_4462 


3940 


3946 


3952 


3958 


787CIP2G 4 


787 4887 


3941 


3947 


3953 


3959 


787CIP2G 5 


787 5794 


3942 


3948 


3954 


3960 


787CIP2G 6 


787 8743 



TABLE 14 



TISSUE ORIGIN 


LIBRARY/ 
RNA SOURCE 


HYS£Q LIBRARY 
NAME 


SEQ ID NOS: 


adult brain 


GIBCO 


ABD003 


3940 


adult brain 


Clontech 


ABR006 


3940 


adult brain 


Invitrogen 


ABR014 


3940 


cultured preadipocytes 


Strategene 


ADPOOl 


3937 


adult heart 


GIBCO 


AHROOl 


3940 


adult kidney 


GIBCO 


AKDOOl 


3940 


adult lung 


GIBCO 


ALGOOl 


3940 


young liver 


GIBCO 


/VLVOOl 


3940 


adult ovary 


Invitrogen 


AOVOOl 


3938,3940-3941 


adult spleen 


GIBCO 


ASPOOl 


3940-3941 


testis 


GIBCO 


ATSOOl 


3940 


bone marrow 


Clontech 


BMDOOl 


3938,3940 


bone marrow 


Clontech 


BMD004 


3940 


adult cervix 


BioChain 


CVXOOl 


3940 


endothelial cells 


Strategene 


EDTOOl 


3940 


fetal brain 


Clontech 


FBR006 


3940 


fetal brain 


Invitrogen 


FBr002 


3940-3941 


fetal heart 


Invitrogen 


FHROOl 


3940 


fetal kidney 


Clontech 


FKDOOl 


3940 


fetal kidney 


Clontech 


FKD0O2 


3940 
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TISSUE ORIGIN 


LIBRARY/ 
RNA SOURCE 


HYSEQ LIBRARY 
NAME 


SEQ ID NOS: 


fetal liver-spleen 


Columbia 
University 


FLSOOl 


3937, 3940 


fetal liver-spleen 

• 


Columbia 
University 


FLS002 


3938, 3941 


fetal liver-spleen 


Columbia 

University 


FLS003 


3940 


fetal liver 


Clontech 


FLV004 


3940 


fetal skin 


Invitrogen 


FSKOOl 


3940-3942 


fetal spleen 


BioChain 


FSPOOl 


3940 


fetal brain 


GIBCO 


HFBOOl 


3937, 3940-3941 


in&nt brain 


Columbia 
University 


IB2002 


3937, 3939,3941 


leukocyte 


GIBCO 


LUCOOl 


3940-3941 


leukocyte 


Clontech 


LUC003 


3940-3941 


melanoma from cell line ATCC 
#CRL 1424 


Clontech 


MEL004 


3940 


mammary gland 


Invitrogen 


MMGOOl 


3937, 3940-3941 


neuronal cells 


Strategene 


NTUOOl 


3937, 3942 


prostate 


Clontech 


PRTOOl 


3938 


rectum 


Invitrogen 


RECOOl 


3940 


salivary gland 


Clontech 


SALs03 


3941 


small intestine 


Clontech 


SINOOl 


3940 


skeletal muscle 


Clontech 


SKMOOl 


3940 


spinal cord 


Clontech 


SPCOOl 


3940 


thymus 


Clontech 


THMc02 


3938 


thyroid gland 


Clontech 


THROOl 


3942 


uterus 


Clontech 


UTROOl 


3940 
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WHAT IS CLAIMED IS: 

1 . An isolated polynucleotide comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO:l-984, 1969-2952, 3937-3942 or 3949-3954, a full length protein 
coding portion of SEQ ID NO:l-984, 1969-2952, 3937-3942 or 3949-3954, a mature protein 
coding portion of SEQ ID NO: 1-984, 1969-2952, 3937-3942 or 3949-3954, an active domain 
coding portion of SEQ ID NO: 1-984, 1969-2952, 3937-3942 or 3949-3954, and complementary 
sequences thereof. 

2. An isolated polynucleotide encoding a polypeptide vsath biological activity, wherein said 
polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization 
conditions. 

3. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide has greater than about 90^0 sequence identity with the polynucleotide of claim 1. 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

6. A vector comprising the polynucleotide of clairn 1 . 

7. An expression vector comprising the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim 1 operatively 
associated with a regulatory sequence that modulates expression of the polynucleotide in the host 
cell. 

10. An isolated polypeptide, wherein the polypeptide is selected from the group consisting of: 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; and 

(b) a polypeptide encoded by a polynucleotide hybridizing under stringent conditions 
with any one of SEQ ID NO: 1-984, 1969-2952, 3937-3942or3949-3954. 
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11. A composition comprising the polypeptide of claim 1 0 and a cairier. 

12. An antibody directed against the polypeptide of claim 10. 

13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polynucleotide of claim 1 for a period sufficient to form the complex; and 

b) detecting the complex, so that if a complex is detected, the polynucleotide 
of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions with 
nucleic acid primers that anneal to the poljmucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least a portion of the polynucleotide of 

claim 1; and 

c) detecting said product and thereby the polynucleotide of claim 1 in the 

sample. 

15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the method 
further comprises reverse transcribing an annealed KNA molecule into a cDNA polynucleotide. 

16. A method for detecting the polypeptide of claim 10 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polypeptide under conditions and for a period sufficient to form the complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 10 is detected. 

17. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 

a) contacting the compound with the polypeptide of claim 10 under 
conditions sufficient to form a polypeptide/compound complex; and 

b) detecting the complex, so that if the polypeptide/compound complex is 
detected, a compound that binds to the polypeptide of claim 10 is identified. 
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18. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 

a) contacting the compound with the polypeptide of claim 10, in a cell, under 
conditions sufficient to forai a polypeptide/compound complex, wherein the complex drives 
expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, so 
that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide 
of claim 10 is identified. 

19. A method of producing the polypeptide of claim 10, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected fromm 
the group consisting of SEQ ID NO: 1-984; 1969-2952, 3937-3942 or 3949-3954, a mature 
protein coding portion of SEQ ID NO: 1-984, 1969-2952, 3937-3942 or 3949-3954, an active 
domain coding portion of SEQ ID NO: 1-984, 1969-2952, 3937-3942 or 3949-3954, 
complementary sequences thereof and a polynucleotide sequence hybridizing under stringent 
conditions to SEQ ID NO: 1-984, 1969-2952, 3937-3942 or 3949-3954, under conditions 
sufficient to express the polypeptide in said cell; and 

b) isolating the polypeptide &om the cell culture or cells of step (a). 

20. An isolated polypeptide comprising an amino acid sequence selected from the group 
consistmg of any one of the polypeptides SEQ ID NO: 985-1968, 2953-3936, 3943-3948 or 

3955-3960, the mature protein portion thereof, or the active domain thereof. 

« 

2 1 . The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide array. 

22. A collection of polynucleotides, wherein the collection comprising the sequence 
infomiation of at least one of SEQ ID NO: 1-984, 1969-2952, 3937-3942 or 3949-3954. 

23. The collection of claim 22, wherein the collection is provided on a nucleic acid array. 

24. The collection of claim 23, wherein the array detects full-matches to any one of the 
polynucleotides in the collection. 

25. The collection of claim 23, wherein the array detects mismatches to any one of the 
polynucleotides in the collection. 
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26. The collection of claim 22, wherein the collection is provided in a computer-readable 
format. 

211 A method of treatment comprising administering to a mammalian subject in need thereof 
a therapeutic amount of a composition comprising a polypeptide of claim 10 or 20 and a 
pharmaceutically acceptable carrier. 

28. A method of treatment comprising administering to a mammalian subject in need thereof 
a therapeutic amount of a composition comprising an antibody that specifically binds to a 
polypeptide of claim 10 or 20 and a pharmaceutically acceptable carrier. 
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SEQUENCE LISTING 

<110> Hyseq, Inc . 

Tang, Y. Tom et al 

<120> Novel Nucleic Acids and Polypeptides 

<130> 21272-029 

<140> Not -Yet Assigned 
<141> 2001-02-05 

<150> Not Yet Assigned 
<151> 2001-01-30 

<150> 09/728,422 
<151> 2000-11-30 

<150> 09/693,325 
<151> 2000-XO-20 

<150> 09/663,561 
<151> 2000-09-15 

<150> 09/654,936 
<151> 2000-09-01 

<150> 09/620,325 
<151> 2000-07-19 

<15Q> 09/598,075 
<151> 2000-06-20 

<150> 09/560,875 
<151> 2000-04-27 

<150> 09/496,914 
<151> 2000-02-03 

<160> 3960 

<170> Custom 



<210> 1 

<211> 18-67 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (416) . . (814) 

<400> 1 

taccggtccg gaattcccgg gtcgacccac gcgtccggcc aaggttgcgg gtcatggagt 60 

cccgaaccct cctcctgctg ttctcgggag ccgtggccct gatccagacc tgggcagatt 120 

acaattacaa tcaaggcaga aatgatctca tttttacatt acaactctgg aaaaggcaat iso 

agactgagat gcaagtgtgc ccccaagtga tgggcagaag gagagaaggt gttttggatg 240 

cattctagaa cacaggtaat ctaaggagag ttgatcaagg ccgtggaaga gtcctccagc 300 
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cactattggc catcaaagga gtcctctgtg tcccaggaat ggtcctgctt tggtgtccct 360 

ggtgtgaccc atcacccgct gggaacagcc tgagagaagt agggcctctg caeca atg 418 

Met 
1 

ctg ctg agg atg tea gag cac agg aac gag gcc ttg gga aat tac ctg 466 
Leu Leu Arg Met Ser Glu His Arg Asn Glu Ala Leu Gly Asn Tyr Leu 

5 10 15 

gaa atg cga ctg aaa tct tec ttc ctg agg ggt ctg ggc tct tgg aaa 514 
Glu Met Arg Leu Lys Ser Ser Phe Leu Arg Gly Leu Gly Ser Trp Lys 
20 25 30 

tea aac cct etc agg ttg ggt ggc tgg acg att etc etc aca ett aca * 562 
Ser Asn Pro Leu Arg Leu Gly Gly Trp Thr lie Leu Leu Thr Leu Thr 
35 40 45 

atg gga eaa ggg gaa cea gga ggc ecc caa ggg gat cce tgg gtt cea 610 
Met Gly Gin Gly Glu Pro Gly Gly Pro Gin Gly Asp* Pro Trp Val Pro 
50 55 60 65 

cac gaa etc etc eta ccc tea ttg tgt gac age age cat gee tec tec 658 
His Glu Leu Leu Leu Pro Ser Leu Cys Asp Ser Ser His Ala Ser Ser 

70 75 80 

tgg gga tea gga tct att acc tgt gcc tgg aga gga ggg gac tec tct 706 
Trp Gly Ser Gly Ser lie Thr Cys Ala Trp Arg Gly Gly Asp Ser Ser 

85 SrO 95 

tct cac ceg ctg gtc tct gga cac ata ctg tec aat tee cct g'tg gqa 754 
Ser His Pro Leu Val Ser Gly His lie Leu Ser Asn Ser Pro Val Ala 
100 105 110 

get gta atg tgt agt tea atg ggc act eat ttg tec cct ttt aag ggt 802 
Ala Val Met Cys Ser Ser Met Gly Thr His Leu Ser Pro Phe Lys Gly 
115 120 125 

acc etc ctt tag aat ccaggacett ctaccctgca gagtgtggtt ttgggagaga 857 

Thr Leu Leu * 

130 



agtgcaaaat 


cccacgaeag 


gtgagttgaa 


ggaatgggat 


atggagccac 


atecacttcc 


917 


aecccttggt 


atctggaccc 


acgtgttctt 


ectaetgaga 


ttacagaact 


gtagagatgt 


977 


etttgatttt 


taaaatgcac 


catgtcctga 


aagatggcac 


cetcccaccc 


geagagtgct 


1037 


tectgeaagc 


tggcgttgag 

• 


ctsrtgcctat 


agaagctctt 


ttcaacattc 


tttatggcca 


1097 


geagecettg 


gttggtgcag 


atggtgatag 


gaccagtggg 


teccacagca 


tggccacact 


1157 


gcacctectt 


cgctgtcaag 


tgggtccccc 


aegaagatae 


tgcacggaga 


gcagtgecaa 


1217 


geetgtggat 


eaggaatatc 


aacagcccce 


agagagtggt 


gctggctgag 


ggtctgagag 


1277 


caggaeagga 


aaacccacet 


atggaatagg 


tgectatcce 


tgtgaagatg 


aacetetggc 


1337 


ccttceagga 


tggaaggagt 


gcaatgtagt 


eaacteetea 


ettagggtet 


ggttggtcac 


1397 


ctaaagaaat 


agagccctac 


cagggaagat 


cattgggttc , 


aaatgctgat 


gagtaggaca 


1457 


tttagaggtg 


gcagtgtctg 


gatetacett 


ggtaggaggg 


agtcagtact 


gttggaeeca 


1517 
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^■V.^N^ 


«k A* .^H 4* 

catccccgcc 


accgugtccg ccccacttat 


«A 

gca.cccaccc 


cacccggcct: 


1577 


gSfgctgaccc 


atggggaagg 


4^ ^v/v 4* ^ ^ 4* ^ ^ ^« ^ ^ M 

cuggccaauc ucagcgcccc 


tgcttggttg 


cucagggcca 


1637 


tttcaggttt 


gggtgttttc 


tggggatgtt aacatgggat 


tcaggctcaa 


ctcacaagaa 


1697 


acttttccat 


ctcatgatgg 


gatgctgttg ggcatgtcca 


atgtatgact 


tcatgagtta 


1757 


cacaggtgct 


aattcgtagg 


ggcacttggg atcacatggg 


tgttttttgt 


ccatggtcaa 


1817 


gcatctatct 


tatcagggct 


acagtaacat gcaaaagtgc 


ttccacattt 




1867 



<210> 2 

<211> 2586 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2553) 

<400> 2 

aaa gca ata agg atg ttt aag tgc tgg tea gtt gtc ttg gtt etc gga 48 

Lys Ala lie Arg Met Phe Lys Cys Trp Ser Val Val Leu VaL Leu Gly 
15 10 15 

ttc att ttt ctg gag teg gaa gga agg cca acc aaa gaa gga gga tat 96 
Phe lie Phe Leu Glu Ser Glu Gly Arg Pro Thr Lys Glu Gly Gly Tyr 

20 25 30 

ggc ctt aaa tee tat cag cct eta atg aga ttg cga cat aag cag gaa 144 
Gly Leu Lys Ser Tyr Gin Pro Leu Met Arg Leu Arg His Lys Gin Glu 
35 40 -45 

aaa aat caa gaa agt tea aga gtc aaa gga ttt atg att eag gat gge 192 
Lys Asn Gin Glu Ser Ser Tbcg Val Lys Gly Phe Met lie Gin Asp Gly 
50 55 60 

cct ttg gga tct tgt gaa aat aag tac tgt ggt ttg gga aga cac tgt 240 
Pro Leu Gly Ser Cys Glu Asn Lys Tyr Cys Gly Leu Gly Arg His Cys 
65 70 75 80 

gtt gee age aga gag aca ggg caa gca gaa tgt gcc tgt atg gac ctt 288 
Val Ala Ser Arg Glu Thr Gly Gin Ala Glu Cys Ala Cys Met Asp Leu 

85 90 95 

tgc aaa cgt cac tac aga cct gtg tgt gga tct gac gga gaa ttc tat 336 
Cys Lys Arg His Tyr Arg Pro Val Cys Gly Ser Asp Gly Glu Phe Tyr 

100 105 110 

gaa aac cac tgt gaa gtg cac aga get get tge ctg aaa aaa caa aag 384 
Glu Asn His Cys Glu Val His Arg Ala Ala Cys Leu Lys Lys Gin Lys 
115 120 125 

att acc att gtt cac aat gaa gac tgc ttc ttt aaa gga gat aag tgc 432 
lie Thr lie Val His Asn Glu Asp Cys Phe Phe Lys Gly Asp Lys Cys 
130 135 140 

aag act act gaa tac age aag atg aaa aat atg eta tta gat tta caa 480 
Lys Thr Thr Glu Tyr Ser Lys Met Lys Asn Met Leu Leu Asp Leu Gin 
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145 150 155 160 

aat caa aaa tat att atg caa gaa aat gaa aat cct aat ggc gac gac 528 
Asn Gin Lys Tyr lie Met Gin Glu Asn Glu Asn Pro Asn Gly Asp Asp 

165 170 * 175 

ata tct egg aag aag eta ttg gtg gat caa atg ttt aaa tat ttt gat 576 
lie Ser Arg Lys Lys Leu Leu Val Asp Gin Met Phe Lys Tyr Phe Asp 

180 185 190 

gca gac agt aat gga ctt gta gat att aat gaa eta act cag gtg ata 624 
Ala Asp Ser Asn Gly Leu Val Asp lie Asn Glu Leu Thr Gin Val lie 
195 200 205 

aaa cag gaa gaa ctt ggc aag gat etc ttt gat tgt act ttg tat gtt 672 
Lys Gin Glu Glu Leu Gly Lys Asp Leu Phe Asp Cys Thr Leu Tyr Val 
210 215 220 

eta ttg aaa tat gat gat ttt aat get gac aag eac ctg get ctt gaa 720 
Leu Leu Lys Tyr Asp Asp Phe Asn Ala Asp Lys His Leu Ala Leu Glu 
225 230 235 240 

gaa ttt tat aga gca ttc caa gtg ate cag ttg agt ctg cca gaa gat 768 
Glu Phe Tyr Arg Ala Phe Gin Val lie Gin Leu Ser Leu Pro Glu Asp 

245 250 255 

cag aaa eta age ate act gca gca act gtg gga caa agt get gtt ctg 8X6 
Gin Lys Leu Ser lie Thr Ala Ala T2ir Val Gly Gin Ser Ala Val Leu 

260 265 270 

age tgt gcc att caa gga ace ctg aga cct ccc att ate tgg aaa agg 864 
Ser Cys Ala lie Gin Gly Thr Leu Arg Pro Pro lie lie Trp Lys Arg 
275 280 285 

aac aat att att eta aat aat tta gat ttg gaa gac ate aat gac ttt 912 
Asn Asn lie lie Leu Asn Asn Leu Asp Leu Glu Asp lie Asn Asp Phe 
290 295 300 

gga gat gat ggg tec ttg tat att act aag gtt acc aca act cac gtt 960 
Gly Asp Asp Gly Ser Leu Tyr lie Thr Lys Val Thr Thr Thr His Val 
305 310 315 320 

ggc aat tac acc tgc tat gca gat ggc tat gaa caa gtc tat cag act 1008 
Gly Asn Tyr Thr Cys Tyr Ala Asp Gly Tyr Glu Gin Val Tyr Gin Thr 

325 330 335 

cac ate ttc caa gtg aat gtt cct cca gtc ate egg gtg tat cca gag 1056 
His lie Phe Gin Val Asn Val Pro Pro Val lie Arg Val Tyr Pro Glu 

340 345 350 

agt cag get aga gag cct ggg gta act gcc agt ctt agg tgc cat gca 1104 
Ser Gin Ala Arg Glu Pro Gly Val Thr Ala Ser Leu Arg Cys His Ala 
355 360 365 

9^9 age ata cca aag cct cag ctt ggc tgg ttg aag aat gga att gat 1152 
Glu Gly lie Pro Lys Pro Gin Leu Gly Trp Leu Lys Asn Gly lie Asp 
370 375 380 

att aca cca aag ctt tec aaa caa etc aeg ctt caa gca aat ggc agt 1200 
He Thr Pro Lys Leu Ser Lys Gin Leu Thr Leu Gin Ala Asn Gly Ser 
385 390 395 400 

gag gtt cac ata age aat gtg cgc tat gaa gat act gga gca tac act 1248 
Glu Val His He Ser Asn Val Arg Tyr Glu Asp Thr Gly Ala Tyr Thr 
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405 410 415 

tgt ate gca aag aat gaa gca gga gtg tat gaa gac ate tct tct ctt 1296 

Cys lie Ala Lys Asn Glu Ala Gly Val Tyr Glu Asp He Ser Ser Leu 

420 425 430 

ttt gtg gaa gac tct get aga aag acc eta get aac ata tta tgg aga X344 

Phe Val Glu Asp Ser Ala Arg Lys Thr Leu Ala Asn He Leu Trp Arg 

435 440 445 

gaa gaa ggt etg gga att ggg aac atg ttc tat gtt ttt tat gaa gat X392 

Glu Glu Gly Leu Gly He Gly Asn Met Phe Tyr Val Phe Tyr Glu Asp 
450 455 460 

gga ate aaa gtg ata eaa eec ata gaa tgt gaa ttt cag agg cac att 1440 

Gly He Lys Val He Gin Pro He Glu Cys gIu Phe Gin Arg His He 
465 470 475 480 

aag cct agt gaa aag etc ctt gga ttt cag gat gaa gtc tgt ccc aaa 1488 

Lys Pro Ser Glu Lys Leu Leu Gly Phe Gin Asp Glu Val Cys Pro Lys 

485 490 495 

get gag gga gat gaa gtt cag agg tgt gtg tgg gca tea get gtt aat 1536 

Ala Glu Gly Asp Glu Val Gin Arg Cys Val Trp Ala Ser Ala Val Asn 

500 505 510 

gtc aaa gac aag ttc att tat gtt gca cag cca act ttg gac aga gtc 1584 

Val Lys Asp Lys Phe He Tyr Val Ala Gin Pro Thr Leu Asp Arg Val 

515 520 525 

ctt att gtt gat gtg cag tec caa aaa gtt gtt cag gca gtg age aca 1632 

Leu He Val Asp Val Gin Ser Gin Lys Val Val Gin Ala Val Ser Thr 
530 535 540 

gac cct gtc cca gtt aaa tta cac tat gac aaa tea cat gat cag gtc 1680 

Asp Pro Val Pro Val Lys Leu His Tyr Asp Lys Ser His Asp Gin Val 
545 550 555 560 

tgg gtg eta age tgg ggt acc ttg gag aag aca tea cca aca eta cag 1728 

Trp Val Leu Ser Trp Gly Thr Leu Glu Lys Thr Ser Pro Thr Leu Gin 

565 570 575 

gta att acc ctg gee agt ggg aat gtg cct cac cac acg ate cac acc 1776 

Val He Thr Leu Ala Ser Gly Asn Val Pro His His Thr He His Thr 

580 585 590 

caa cca gtg gga aag caa ttt gac aga gtg gat gat ttt ttc att ccc 1824 

Gin Pro Val Gly Lys Gin Phe Asp Arg Val Asp Asp Phe Phe He Pro 

595 600 605 

acc aca aca etc att ate acc cat atg agg ttt gga ttt att ctt cat 1872 

Thr Thr Thr Leu He He Thr His Met Arg Phe Gly Phe He Leu His 
610 615 620 

aaa gat gaa get gca eta caa aaa att gat ctt gaa ace atg tea tac 1920 

Lys Asp Glu Ala Ala Leu Gin Lys He Asp Leu Glu Thr Met Ser Tyr 
625 630 635 640 

ate aag aca att aac ttg aag gac tat aag tgc gtt cct cag tea ttg 1968 

He Lys Thr He Asn Leu Lys Asp Tyr Lys Cys Val Pro Gin Ser Leu 

645 650 655 

gca tat aca cac ttg gga gge tac tac ttc att ggc tgc aaa cct gac 2 016 

Ala Tyr Thr His Leu Gly Gly Tyr Tyr Phe He Gly Cys Lys Pro Asp 
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670 



age acc gga gca gtt tec cca cag gtc atg gtg gac ggt gta act gac 2064 
Ser Thr Gly Ala Val Ser Pro Gin Val Met Val Asp Gly Val Thr Asp 
675 680 685 

tea gtc att ggg ttc aat agt gat gtg acg ggc act cca tat gtc tct 2112 
Ser Val lie Gly Phe Asn Ser Asp Val Thr Gly Thr Pro Tyr Val Ser 
690 695 700 

cca gat ggc cac tac ctt gtc age att aat gat gtg aaa ggt ctt gta 2160 
Pro Asp Gly His Tyr Leu Val Ser He Asn Asp Val Lys Gly Leu Val 
705 710 715 720 

agg gtt cag tac att acc ate aga gga gaa ata cag gag get ttt gat 2208 
Arg Val Gin Tyr lie Thr lie Arg Gly Glu He Gin Glu Ala Phe Asp 

725 730 735 

att tac aca aat ctg cac ata tct gat ctg gca ttt caa cca tec ttt 2256 
He Tyr Thr Asn Leu His He Ser Asp Leu Ala Phe Gin Pro Ser Phe 

740 745 750 

act gaa gcc cac caa tat aac ate tac ggt agt tea age aca caa act 2304 
Thr Glu Ala His Gin Tyr Asn He Tyr Gly Ser Ser Ser Thr Gin Thr 
755 760 765 

gat gtg etc ttt gtg gag etc tct tct ggg aag gtc aag atg ata aag 2352 
Asp Val Leu Phe Val Glu Leu Ser Ser Gly Lys Val Lys Met He Lys 
770 775 780 

agt etc aag gaa cca etc aag gca gaa gaa tgg cct tgg aac egg aaa 2400 
Ser Leu Lys Glu Pro Leu Lys Ala Glu Glu Trp Pro Trp Asn Arg Lys 
785 790 795 800 

aac agg caa ate cag gac agt ggc ttg ttt ggt caa tac ctg atg aca 2448 
Asn Arg Gin He Gin Asp Ser Gly Leu Phe Gly Gin Tyr Leu Met Thr 

805 810 815 



cct tec aag gac tct etc ttc ate eta gat gga cga etc aat aaa tta 2496 

Pro Ser Lys Asp Ser Leu Phe He Leu Asp Gly Arg Leu Asn Lys Leu 

820 825 830 

aac tgt gag ate act gaa gtt gaa aaa gga aat aca gtc att tgg gtt 2544 

Asn Cys Glu He Thr Glu Val Glu Lys Gly Asn Thr Val He Trp Val 
835 840 845 

gga gat gcc taaaaaccct agggtacaat tattgaatga agg 2586 
Gly Asp Ala 
850 



<210> 3 

<211> 2856 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (182) . . (2593) 

<400> 3 
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aaattggtat agcagtaccc agctggctag cgtttaaact taagcttggt accgagctcg 60 

gatccactag tccagtgtgg tggaattccg cggctggagg tgtgaggatc cgaacccagg 120 

ggtggggggt ggaggcggct cctgcgatcg aaggggactt gagactcacc ggccgcacgc 180 

c atg agg gcc ctg tgg gtg ctg ggc etc tgc tgc gtc ctg ctg acc 226 
Met Arg Ala Leu Trp Val Leu Gly Leu Cys Cys Val Leu Leu Thr 
15 10 15 

ttc ggg teg gtc aga get gac gat gaa gtt gat gtg gat ggt aca gta 274 
Phe Gly Ser Val Arg Ala Asp Asp Glu Val Asp Val Asp Gly Thr Val 

20 25 30 

gaa gag gat ctg ggt aaa agt aga gaa gga tea agg acg gat gat gaa 322 
Glu Glu Asp Leu Gly Lys Ser Arg Glu Gly Ser Arg Thr Asp Asp Glu 

35 40 45 

gta gta eag aga gag gaa gaa get att cag ttg gat gga tta aat gca 370 
Val Val Gin Arg Glu Glu Glu Ala lie Gin Leu Asp Gly Leu Asn Ala 
50 55 60 

tea caa ata aga gaa ett aga gag aag teg gaa aag ttt gcc ttc caa 418 
Ser Gin lie Arg Glu Leu Arg Glu Lys Ser Glu Lys Phe Ala Phe Gin 
65 70 75 

gcc gaa gtt aac aga atg atg aaa ctt ate ate aat tea ttg tat aaa 466 
Ala Glu Val Asn Arg Met Met Lys Leu lie lie Asn Ser Leu Tyr Lys 
80 85 90 95 

aat aaa gag att ttc ctg aga gaa ctg att tea aat get tct gat get 514 
Asn Lys Glu lie Phe Leu Arg Glu Leu lie Ser Asn Ala Ser Asp Ala 

100 105 110 

tta gat aag ata agg eta ata tea etg act gat gaa aat get ctt tct 562 
Leu Asp Lys lie Arg Leu lie Ser Leu Thr Asp Glu Asn Ala Leu Ser 

115 120 125 

gga aat gag gaa eta aca gtc aaa att aag tgt gat aag gag aag aac 610 
Gly Asn Glu Glu Leu Thr Val Lys lie Lys Cys Asp Lys Glu Lys Asn 
130 135 140 

ctg ctg cat gtc aca gac acc ggt gta gga atg acc aga gaa gag ttg 658 
Leu Leu His Val Thr Asp Thr Gly Val Gly Met Thr Arg Glu Glu Leu 
145 150 155 

gtt aaa aac ctt ggt acc ata gcc aaa tct ggg aca age gag ttt tta 706 
Val Lys Asn Leu Gly Thr lie Ala Lys Ser Gly Thr Ser Glu Phe Leu 
160 165 170 175 

aac aaa atg act gaa gca cag gaa gat ggc cag tea act tct gaa ttg 754 
Asn Lys Met Thr Glu Ala Gin Glu Asp Gly Gin Ser Thr Ser Glu Leu 

180 185 190 

att ggc. cag ttt ggt gtc ggt ttc tat tec gcc ttc ctt gta gca gat 8 02 

lie Gly Gin Phe Gly Val Gly Phe Tyr Ser Ala Phe Leu Val Ala Asp 

195 200 205 

aag gtt att gtc act tea aaa cae aac aac gat acc eag cac ate tgg .850 
Lys Val lie Val Thr Ser Lys His Asn Asn Asp Thr Gin His lie Trp 
210 215 220 

gag tct gac tee aat gaa ttt tct gta att get gac eca aga gga aac 898 
Glu Ser Asp Ser Asn Glu Phe Ser Val lie Ala Asp Pro Arg Gly Asn 
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225 230 235 

act eta gga egg gga acg aca att acc ctt gtc tta aaa gaa gaa gca 946 
Thr Leu Gly Arg Gly Thr Thr lie Thr Leu Val Leu Lys Glu Glu Ala 
240 245 250 255 

tct gat tac ctt gaa ttg gat aca att aaa aat etc gtc aaa aaa tat 994 
Ser Asp Tyr Leu Glu Leu Asp Thr lie Lys Asn Leu Val Lys Lys Tyr 

260 265 . 270 

tea cag ttc ata aac ttt cot att tat gta tgg age age aag act gaa 1042 
Ser Gin Phe lie Asn Phe Pro lie Tyr Val Trp Ser Ser Lys Thr Glu 

275 260 265 

act gtt gag gag ccc atg gag gaa gaa gaa gca gee aaa gaa gag aaa 1090 
Thr Val Glu Glu Pro Met Glu Glu Glu Glu Ala Ala Lys Glu Glu Lys 
290 295 300 

gaa gaa tct gat gat gaa get gca gta gag gaa gaa gaa gaa gaa aag 1138 
Glu Glu Ser Asp Asp Glu Ala Ala Val Glu Glu Glu Glu Glu Glu Lys 
305 310 315 

aaa cca aag act aaa aaa gtt gaa aaa act gtc tgg gac tgg gaa ctt 1186 
Lys Pro Lys Thr Lys Lys Val Glu Lys Thr Val Trp Asp Trp Glu Leu 
320 325 330 335 

atg aat gat ate aaa cca ata tgg cag aga cca tea aaa gaa gta gaa 1234 
Met Asn Asp lie Lys Pro lie Trp Gin Arg Pro Ser Lys Glu Val Glu 

340 345 350 

gaa gat gaa tac aaa get ttc tac aaa tea ttt tea aag gaa agt gat 1282 
Glu Asp Glu Tyr Lys Ala Phe Tyr Lys Ser Phe Ser Lys Glu Ser Asp 

355 360 365 

gac ccc atg get tat att cac ttt act get gaa ggg gaa gtt acc ttc 1330 
Asp Pro Met Ala Tyr lie His Phe Thr Ala Glu Gly Glu Val Thr Phe 
370 375 380 

aaa tea att tta ttt gta ccc aca tct get cca cgt ggt etg ttt gac 1378 
Lys Ser lie Leu Phe Val Pro Thr Ser Ala Pro Arg Gly Leu Phe Asp 
385 390 395 

gaa tat gga tct aaa aag age gat tac att aag etc tat gtg egc cgt 1426 
Glu Tyr Gly Ser Lys Lys Ser Asp Tyr lie Lys Leu Tyr Val Arg Arg 
400 405 410 415 

gta ttc ate aca gac gac ttc eat gat atg atg ect aaa tac etc aat 1474 
Val Phe lie Thr Asp Asp Phe His Asp Met Met Pro Lys Tyr Leu Asn 

420 425 430 

ttt gtc aag ggt gtg gtg gac tea gat gat etc ccc ttg aat gtt tec 1522 
Phe Val Lys Gly Val Val Asp Ser Asp Asp Leu Pro Leu Asn Val Ser 

435 440 445 

egc gag act ett cag caa cat aaa ctg ett aag gtg att agg aag aag 1570 
Arg Glu Thr Leu Gin Gin His Lys Leu Leu Lys Val He Arg Lys Lys 
450 455 460 

ctt gtt cgt aaa acg ctg gac atg ate aag aag att get gat gat aaa 1618 
Leu Val Arg Lys Thr Leu Asp Met He Lys Lys He Ala Asp Asp Lys 
465 470 475 

tac aat gat act ttt tgg aaa gaa ttt ggt acc aac ate aag ctt ggt 16 66 

Tyr Asn Asp Thr Phe Trp Lys Glu Phe Gly Thr Asn He Lys Leu Gly 



8 



wo 01/57190 PCT/USOl/04098 

480 485 490 495 

gtg att gaa gac cac teg aat cga aca cgt ctt get aaa ctt ctt agg 1714 

Val lie Glu Asp His Ser Asn Arg Thr Arg Leu Ala Lys Leu Leu Arg 

500 505 510 

ttc cag tct tct cat cat cca act gac att act age eta gac cag tat 1762 

Phe Gin Ser Ser His His Pro Thr Asp lie Thr Ser Leu Asp Gin Tyr 

515 520 525 

gtg gaa aga atg aag gaa aaa caa gac aaa ate tac ttc atg get ggg . 1810 

Val Glu Arg Met Lys Glu Lys Gin Asp Lys lie Tyr Phe Met Ala Gly 
530 535 540 

tec age aga aaa gag get gaa tct tct cca ttt gtt gag cga ctt etg 1858 

Ser Ser Arg Lys Glu Ala Glu Ser Ser Pro Phe Val Glu Arg. Leu Leu 
545 550 555 

aaa aag ggc tat gaa gtt att tac etc aca gaa cct gtg gat gaa tac 1906 

Lys Lys Gly Tyr Glu Val lie Tyr Leu Thr Glu Pro Val Asp Glu Tyr 
560 565 570 575 

tgt att cag gcc ctt ccc gaa ttt gat ggg aag agg ttc cag aat gtt 1954 

Cys lie Gin Ala Leu Pro Glu Phe Asp Gly Lys Arg Phe Gin Asn Val 

580 585 590 

gcc aag gaa gga gtg aag ttc gat gaa agt gag aaa act aag gag agt 2002 

Ala Lys Glu Gly Val Lys Phe Asp Glu Ser Glu Lys Thr Lys Glu Ser 

595 600 605 

cgt gaa gca gtt gag aaa gaa ttt gag cct ctg ctg aat tgg atg aaa 2050 

Arg Glu Ala Val Glu Lys Glu Phe Glu Pro Leu Leu Asn Trp Met Lys 
610 615 620 

gat aaa gcc ctt aag gac aag att gaa aag get gtg gtg tct cag cgc 2 098 

Asp Lys Ala Leu Lys Asp Lys lie Glu Lys Ala Val Val Ser Gin Arg 
625 630 635 

ctg aca gaa tct ccg tgt get ttg gtg gcc age cag tac gga tgg tct 2146 

Leu Thr Glu Ser Pro Cys Ala Leu Val Ala Ser Gin Tyr Gly Trp Ser 
640 645 650 655 

ggc aac atg gag aga ate atg aaa gca caa gcg tac caa acg ggc aag 2194 

Gly Asn Met Glu Arg He Met Lys Ala Gin Ala Tyr Gin Thr Gly Lys 

660 665 670 

gac ate tct aca aat tac tat gcg agt cag aag aaa aca ttt gaa att 2242 

Asp He Ser Thr Asn Tyr Tyr Ala Ser Gin Lys Lys Thr Phe Glu He 

675 680 685 

aat ccc aga cac ccg ctg ate aga gac atg ctt cga cga att aag gaa 2290 

Asn Pro Arg His Pro Leu He Arg Asp Net Leu Arg Arg He Lys Glu 
690 695 700 

gat gaa gat gat aaa aca gtt ttg gat ctt get gtg gtt ttg ttt gaa 2338 

Asp Glu Asp Asp Lys Thr Val Leu Asp Leu Ala Val Val Leu Phe Glu 
705 710 715 

aea gca acg ctt egg tea ggg tat ctt tta cca gac act aaa gca tat 2386 

Thr Ala Thr Leu Arg Ser Gly Tyr Leu Leu Pro Asp Thr Lys Ala Tyr 
720 725 730 735 

gga gat aga ata gaa aga atg ctt cgc etc agt ttg aac att gae cct 2434 

Gly Asp Arg He Glu Arg Met Leu Arg Leu Ser Leu Asn He Asp Pro 
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740 745 750 

gat gca aag gtg gaa gaa gag cct gaa gaa gaa cct gaa gag aca gca 2482 
Asp Ala Lys Val Glu Glu Glu Pro Glu Glu Glu Pro Glu Glu Thr Ala 

755 760 765 

gaa gac aca aca gaa gac aca gag caa gac gaa gat gaa gaa atg gat 2530 
Glu Asp Thr Thr Glu Asp Thr Glu Gin Asp Glu Asp Glu Glu Met Asp 
770 775 780 

gtg gga aca gat gaa gaa gaa gaa aca gca aag gaa tct aca get gaa 2578 
Val Gly Thr Asp Glu Glu Glu Glu Thr Ala Lys Glu Ser Thr Ala Glu 
785 790 795 

aaa gat gaa ttg taa attatactct caeca tttgg atcctgtgtg gagagggaat 2633 

Lys Asp Glu Leu * 

800 

gtgaaattta catcatttct ttttgggaga gacttgtttt ggatgccccc taatcccctt 2693 

ctcccctgca ctgtaaaatg tgggattatg ggtcacagga' aaaagtgggt tttttagttg 2753 

aatttttttt aacattcctc atgaatgtaa atttgtacta tttaactgac tattcttgat 2813 

gtaaaatctt gtcatgtgta taaaaataaa aaagatccca aat 2856 



<210> 4 

<211> 415 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (249) 

<400> 4 

ttg egg aat teg gca cga ggc cga eta caa cag att gga gee atg get 48 

Leu Arg Asn Ser Ala Arg Gly A3:g Leu Gin Gin lie Gly Ala Met Ala 
15 10 15 

ttg gaa eag aac cag tea aca gat tat tat tat gag gaa aat gaa atg 96 
Leu Glu Gin Asn Gin Ser Thr Asp Tyr Tyr Tyr Glu Glu Asn Glu Met 

20 25 30 

aat ggc act tat gac tae agt caa tat gaa ctg ate tgt ate aaa gaa 144 
Asn Gly Thr Tyr Asp Tyr Ser Gin Tyr Glu Leu lie Cys lie Lys Glu 
35 40 45 

gat gtc aga gaa ttt gca aaa gtt ttc etc cct gta ttc etc aca ata 192 
Asp Val Arg Glu Phe Ala Lys Val Phe Leu Pro Val Phe Leu Thr lie 
50 55 60 

gtt ttc gtc att gga ctt gca ggc aat tec atg gta gtg gca att tat 240 
Val Phe Val lie Gly Leu Ala Gly Asn Ser Met Val Val Ala lie Tyr 
65 70 75 80 

gee aaa cat taaaata tctcgacccc taaaagttct gctcacagtc gttatagttt 296 
Ala Lys His 



tcattgteac tcaactgcct tataacattg tcaagttctg ccgagccata gacatcatct 356 
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actccctgat caccagctgc aacatgagca aacgcatgga catcgccatc caagtcaca 415 



<210> 5 

<211> 3190 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (653) . . (1072) 
<400> 5 

tccttgaccg gtccggaatt cccgggtcga cgatttcgtg gcggtagctg ctttccccac 60 

aaacccaaac tacatcccca aggtcagggc caggctctgt tgagtaatgc tccccatcac 120 

agcacgcagc acaaggctgg ctgtttgaca aatacacaca taaccaaaag acatgtgaat 180 

gtggtttccc aactcacttt aaacaaatat tccgggacca acttcaagga cgttctacaa 240 

cagaaactgg actgtactct tcacaaacat caatgtcatg aaagacagaa aggctgagga 3 00 

cctgctccaa acgaaaggag gctgcagaga ggacaactaa aggcagcgct tgatcctgga 360 

ttggatcctg gaccagaaaa acaacggctt tcaaaatcat tattgggaca gttggcaaat 420 

ttgaaacagg aactacaaat tagcaactcc tgggctgatc agagaatatt acacatttcc 480 

tgaatttgag aactgtacca tgggcatgtg agaggctgtc cctgttcttg ggaaatgttc 540 

actaaaacat cttacggata aacgagcatg atgtatacaa ataatcctca aatggttcat 600 

ggttcagaga ggataggaag aaacaggggt gagggagaag aggagggaac ga atg 655 

Met 
1 

aaa gag aaa atg tgg caa aat gtt ctt tgt tgt acc ctt caa act get 703 
Lys Glu Lys Met Trp Gin Asn Val beu Cys Cys Thr Leu Gin Thr Ala 

5 10 15 

gtg att ttg aaa tta ttt caa aat aaa gtt tta aat att tta aaa aat 751 
Val lie Leu Lys Leu Phe Gin Asn Lys Val Leu Asn He Leu Lys Asn 
20 25 30 

ttt ttt ctt tct ccc ctt gat acc agg aag aat aaa gtt ttt aaa aaa 799 
Phe Phe Leu Ser Pro Leu Asp Thr Arg Lys Asn Lys Val Phe Lys Lys 
35 40 45 

tgg gca ggt ggg cca ggc gcg gtg get cac gcc tgt aat ccc age act 847 
Trp Ala Gly Gly Pro Gly Ala Val Ala His Ala Cys Asn Pro Ser Thr 
50 55 60 65 

ttg gga ggc cga ggc ggg egg ate aca aag tea gga gat cga gac cat 895 
Leu Gly Gly Arg Gly Gly Arg He Thr Lys Ser Gly Asp Arg Asp His 

70 75 80 

• 

ect ggc caa eac ggt gaa acc egg tct eta eta aaa gta caa aaa att 943 
Pro Gly Gin His Gly Glu Thr Arg Ser Leu Leu Lys Val Gin Lys He 

85 90 95 
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age cag gtg tgg tgg cag atg act gta ggc caa get aat tgg gag get 991 

Ser Gin Val Trp Trp Gin Met Thr Val Gly Gin Ala Asn Trp Glu Ala 

100 105 110 

gag gca gga gaa tgg tgc gaa ccc ggg gag ggc aga get tgc agt gag 1039 

Glu Ala Gly Glu Trp Cys Glu Pro Gly Glu Gly Arg Ala Cys Ser Glu 

115 120 125 



cca aga tea ccc act gca etc cag act ggg taa cagagcaa gactcggtct 1090 
Pro Arg Ser Pro Thr Ala Leu Gin Thr Gly * 



130 


135 


140 








caaaaaaaat 


aataagttac 


atttaaatgt 


catatacata 


tttaagaaaa 


aaaaaaccaa 


1150 


gtacttctca 


tttaagacag 


agtagaaatt 


atttaaaatt 


aggagttggt 


gtaaaggatg 


1210 


agctacatat 


tcaagtcaaa 


ttatagtaag 


tattcactat 


tccactacca 


aagtaggtca 


1270 


attatactaa 


agagaagaaa 


tctatgtgaa 


ttgaggcatt 


ttctcacttt 


gatatatgtg 


1330 


aataaatttc 


aggttgtcta 


aattccbagg 


gttatatagt 


tagaaatata 


taattctctt 


1390 


atagacaggt 


caactagggg 


aaataagtta 


gcacaatcat 


ttgaattggt 


tgtctacata 


1450 


ctgggcaggg 


cttattcctt 


ttctttagct 


tctttgcaca 


tgtaaagcag 


gccataagat 


1510 


gtcctgtttt 


gccatggaca 


atgcaaccat 


ttttaggtcg 


accttgacaa 


atcacacaag 


1570 


gttcaatggc 


attaaggggc 


aaactagatt 


ccacactctc 


ttctttgtct 


tgggtttctt 


1630 


ccctttcaaa 


ctctttcaca 


tcttcttggc 


tgctataaat 


aatgctacta 


gaagttgatg 


1690 


gctgagaata 


gtcttcactt 


tcttgtgatt 


gtgaagcttg 


tgtaatttta 


tcatcatttt 


1750 


cctcaacaca 


tgactctctg 


gaatcattca 


ctatagtttt 


tttacaatca 


ggaacatcaa 


1810 


agccctcttc 


agcttgtgtt 


gagttttcca 


gtttggcttt 


ctcagagatt 


tcccctttat 


1870 


ctttcccttt 


atcttcagga 


agccaattct 


cacgaagggc 


ccaacatctg 


ttgcaatgtg 


1930 


atggaagggg 


gggattcatt 


tcattgcatg 


aagtgcattt 


ccaatagtca 


gctaaggaaa 


1990 


tttcaggatc 


ttcttcaaat 


gaatctgtat 


cactctcccc 


tgcctgatac 


acagtaactt 


2050 


gatatacctc 


atcatcttca 


tctgagagtt 


cttgtccttc 


ttcactaagg 


ctataatctt 


2110 


ctcracitccracr 


agattcaact 


tcaaattcta 


cactaaactcr 


atctoaaact 


caa tec teat 


2170 


ccaaccaatc 


acctgaatgt 


tcacttacac 


cagcatcaag 


atccggattc 


gatggcgtcc 


2230 


ctgtagattc 


actgctactg 


cttctttcac 


aacatatctc 


ccttattaca 


cacagagcca 


2290 


ggctttcatc 


aaaggaaagg 


gaaatactat 


cagatttgtg 


gcgttttctt 


tgtcgttcac 


2350 


cagataattc 


atctgaattt 


tcttctgtct 


cactaattgc 


tctccttcta 


gatgaggtag 


2410 


atggtctaga 


aaccaaatgt 


gaagatgaag 


gtttctcttc 


ctgaagctct 


tgtacaaggt 


2470 


ccttttgatc 


actcccacct 


tcaaggtgac 


acctgttctc 


actcacagat 


gtacctgagt 


2530 


ccgatgattc 


ctgctgattg 


actactacca 


agttcctgta 


gatcatggta 


tatattttcc 


2590 


tgtgctcttt 


cacagagaag 


cttggcacgc 


caaacaaatc 


tcctagaaga 


tcatttgaac 


2650 
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aatatacaat atgttgttgc ttctcatcat ataatcgttt agtcataata tactggccaa 2710 

gataaaaaag aacctctttc atagtataag tgtctttttg tgcaccaaca gactttaata 2770 

acttcaaaag caatggcttt ggtctaacca gggtctcttg ttccgaagct ggaatctgtg 2830 

aggtggttac agcaccatca gtaggtacag acatgttggt attgcacatt tgcctgctcc 2890 

tcaccatccg gggttttcgc gcttggagtc gggggtccct caagactccc cagtttcctt 2950 

cacggggact gactactgcg gaagcacgac gccctgggcc tcggggatca ttccactctc 3010 

cgggccaggg cactgggcgc tcgtacgcac taatccgggg agggacggtg ctcctggctg 3070 

cgaaagcagc aggatctcgg tcagaggggt cgcggccgcc cctcgggctc ggcttcttgc 3130 

tccatctttc cgacacacag ggccacacag gccccagaag cagccaagct cgcgcggtgc 3190. 



<210> 6 
<211> 1343 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (291) . . (1109) 
<400> 6 

gcggccgccc cggcggctcc tggaaccccg gttcgcggcg atgccagcca ccccagcgaa 60 

gccgccgcag ttcagtgctt ggataatttg aaagtacaat agttggtttc cctgtccacc 120 

cgccccactt cgcttgccat cacagcacgc ctatcggatg tgagaggaga agtcccgctg 180 

ctcgggcact gtctatatac gcctaacacc tacatatatt ttaaaaacat taaatataat 240 

taacaatcaa aagaaagagg agaaaggaag ggaagcatta ctgggttact atg cac 296 

Met His 
1 

ttg cga ctg att tct tgg ctt ttt ate att ttg aac ttt atg gaa tac 344 
Leu Arg Leu lie Ser Trp Leu Phe lie lie Leu Asn Phe Met Glu Tyr 
5 10 15 

ate ggc age caa aac gcc tec egg gga agg cgc cag cga aga atg cat 392 
lie Gly Ser Gin Asn Ala Ser Arg Gly T^g Arg Gin Arg Arg Met His 
20 25 30 

cct aac gtt agt caa ggc tgc caa gga ggc tgt gca aca tgc tea gat 440 
Pro Asn Val Ser Gin Gly Cys Gin Gly Gly Cys Ala Thr Cys Ser Asp 
35 40 45 50 

tac aat gga tgt ttg tea tgt aag ccc aga eta ttt ttt get ctg gaa 488 
Tyr Asn Gly Cys Leu Ser Cys Lys Pro Arg Leu Phe Phe Ala Leu Glu 

55 60 65 

aga att ggc atg aag cag att gga gta tgt etc tct tea tgt eca agt 536 
Arg lie Gly Met Lys Gin lie Gly Val Cys Leu Ser Ser Cys Pro Ser 

70 75 80 

gga tat tat gga act cga tat cea gat ata aat aag tgt aca aaa tgc 584 
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Gly Tyr Tyx Gly Thr Arg Tyr Pro Asp He Asn Lys Cys Thr Lys Cys 
85 90 95 

aaa get gac tgt gat acc tgt ttc aac aaa aat ttc tgc aca aaa tgt 632 
Lys Ala Asp Cys Asp Thr Cys Phe Asn Lys Asn Phe Cys Thr Lys Cys 
100 105 110 

aaa agt gga ttt tac tta cac ctt gga aag tgc ctt gac aat tgc cca 680 
Lys Ser Gly Phe Tyr Leu His Leu Gly Lys Cys Leu Asp Asn Cys Pro 
115 120 125 130 

gaa ggg ttg gaa gcc aac aac cat act atg gag tgt gtc agt att gtg 728 
Glu Gly Leu Glu Ala Asn Aan His Thr Met 61u. Cys Val Ser He Val 

135 140 145 

cac tgt gag gtc agt gaa tgg aat cct tgg agt cca tgc acg aag aag 776 
His Cys Glu Val Ser Glu Trp Asn Pro Trp Ser Pro Cys Thr Lys Lys 

150 155 160 

gga aaa aca tgt ggc ttc aaa aga ggg act gaa aca egg gtc cga gaa 824 
Gly Lys Thr Cys Gly Phe Lys Arg Gly Thr Glu Thr Arg Val Arg Glu 
165 170 175 

ata ata cag cat cct tea gca aag ggt aac eta tgt ccc cca aca aat 872 
He He Gin His Pro Ser Ala Lys Gly Asn Leu Cys Pro Pro Thr Asn 
180 185 190 

gag aca aga aag tgt aca gtg caa agg aag aag tgt cag aag gga gaa 920 
Glu Thr Arg Lys Cys Thr Val Gin Arg Lys Lys Cys Gin Lys Gly Glu 
195 200 205 210 

cga gga aaa aaa gga agg gag agg aaa aga' aaa aaa cct aat aaa gga 968 
Arg Gly Lys Lys Gly Arg Glu Arg Lys Arg Lys Lys Pro Asn Lys Gly 

215 220 225 

gaa agt aaa gaa gca ata cct gac age aaa agt ctg gaa tec age aaa 1016 
Glu Ser Lys Glu Ala He Pro Asp Ser Lys Ser Leu Glu Ser Ser Lys 

230 235 240 

gaa ate cca gag caa cga gaa aac aaa cag cag cag aag aag cga aaa 1064 
Glu He Pro Glu Gin Arg Glu Asn Lys Gin Gin Gin Lys Lys Arg Lys 
245 250 255 

gtc caa gat aaa cag aaa teg gta tea gtc age act gta cac tag agg 1112 
Val Gin Asp Lys Gin Lys Ser Val Ser Val Ser Thr Val His * • 
260 265 270 

gttccatgag attattgtag actcatgatg ctgetatcte aaccagatgc ccaggacagg 1172 

tgctctagcc attaggacca caaatggaca tgtcagttat tgctctgtct aaacaacatt 1232 

cccagtagtt getatattct teatacaagc atagttaaca acaaagagcc aaaagatcaa 12 92 

agaagggata ctttcagatg gttgtcttgt gtgcttctct gcatttttaa a 1343 



<210> 7 

<211> 973 

<212> DNA 

<213> Homo sapiens 
<220> 
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<221> CDS 

<222> (1) . . (795) 

<400> 7 

atg gat cca acc ate tea acc ttg gacaca gaa ctg aca cca ate aac 4 8 

Met Asp Pro Thr He Ser Thr Leu Asp Thr Glu Leu Thr Pro lie Asn 
15 10 15 

gga act gag gag act ctt tgc tac aag cag acc ttg age etc acg gtg ' 96 
Gly Thr Glu Glu Thr Leu Cys Tyr Lys Gin Thr Leu Ser Leu Thr Val 

20 25 30 

ctg acg tgc ate gtt tec ctt gtc ggg ctg aca gga aac gca gtt gtg 144 
Leu Thr Cys He Val Ser Leu Val Gly Leu Thr Gly Asn Ala Val Val 
35 40 45 

etc tgg etc ctg ggc tgc cgc atg egc agg aac gee ttc tec ate tac 192 
Leu Trp Leu Leu Gly Cys Arg Met Arg Arg Asn Ala Phe Ser He Tyr 
50 55 60 

ate etc aac ttg gee gca gca gac ttc etc ttc etc age ggc cgc ctt 240 
He Leu Asn Leu TQa Ala Ala Asp Phe Leu Phe Leu S^r Gly Arg Leu 
65 70 75 80 

ata tat tec ctg tta age ttc ate agt ate ecc cat ace ate tct aaa 288 
He Tyr Ser Leu Leu Ser Phe He Ser He Pro His Thr He Ser Lys 

85 90 95 

ate etc tat cct gtg atg atg ttt tec tac ttt gca ggc ctg age atg 336 
He Leu Tyr Pro Val Met Met Phe Ser Tyr Phe Ala Gly Leu Ser Met 

100 105 110 

ctg age ace ate age acc gag eac cgc ctg tee gtc ctg tgg ecc ate 384 
Leu Ser Thr He Ser Thr Glu His Arg Leu Ser Val Leu Trp Pro He 
115 120 125 

tgg tac tgc tgc eac tgc ecc aca cac ctg tea geg gtc atg tgt gtc 432 
Trp Tyr Cys Cys His Cys Pro Thr His Leu Ser Ala Val Met Cys Val 
130 135 140 

ctg etc tgg gee ctg tec ctg ttg cag age ate ctg gag tgg atg ttc 480 
Leu Leu Trp Ala Leu Ser Leu Leu Gin Ser He Leu Glu Trp Met Phe 
145 150 155 160 

tgt age ttc ctg ttt agt gat gtt gac tct gat aat tgg tgt caa ata 528 
Cys Ser Phe Leu Phe Ser Asp Val Asp Ser Asp Asn Trp Cys Gin He 

165 170 175 

tta gat ttc etc aca gtc geg tgg ctg att ttt tta ate tgt ggt tct 576 
Leu Asp Phe Leu Thr Val Ala Trp Leu He Phe Leu He Cys Gly Ser 

180 185 190 

ctg tgg gtt eac cct ggt cct get gat cag gat cat atg tgg ate ccg 624 
Leu Trp Val His Pro Gly Pro Ala Asp Gin Asp His Met Trp He Pro 
195 2O0 205 

gaa gat ace get gac cag get gta tgt gac cat cct get eac agg get 572 
Glu Asp Thr Ala Asp Gin Ala Val Cys TVsp His Pro Ala His Arg Ala 
210 215 220 

ggt ctt cct cct ctg tgg cct gcc cct cag cat tea gtt ttt cct att 720 
Gly Leu Pro Pro Leu Trp Pro Ala Pro Gin His Ser Val Phe Pro He 

225 230 235 240 
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ata ctg gat cca cgt gga cag gga agt ctt att ttg tea tgt tea tct 768 
lie Leu Asp Pro Arg Gly Gin Gly Ser Leu lie Leu Ser Cys Ser Ser 

245 250 255 ■ 

agt ttc tat ttt cct gtc cgc tct taa cagca gtgccaaccc catcatttac 820 
Ser Phe Tyr Phe Pro Val Arg Ser * 

260 26S 

ttcttcgtgg gctcctttag gcagcgtcaa aataggcaga acctgaagct ggttctccag 880 

agggctctgc aggacgcgtc tgaggtggat gaaggtggag ggcagcttcc tgaggaaatc 940 

ctggagctgt cgggaagcag attggagcag tga 973 



<210> 8 
<211> 639 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<:222> (217) , . (453) 
<400> 8 

atttggccct cgaggccaag aattcggcac gaggcgcttt tgcaaagccg cggaggtgaa 60 

gtgaacttag aggttgtggg gccgaggggt cgtcttatag ctaccagccc acaggcattt 120 

agtctacgtt ggaggtaaac aaatacgggt cctgcttagg agaaaagaaa aacgtcttac 180 

agccagtgtc taaactccaa acaacggaat gtatca atg aga cct tgt ata tgg 234 

Met Arg Pro Cys lie Trp 
1 5 

ata cac gtg cat tta aaa ccg ccc tgc egg ctt gta gag ctt ttg ccg 282 
lie His Val His Leu Lys Pro Pro Cys Arg Leu Val Glu Leu Leu Pro 

10 15 20 

ttc tec age get tta eag ggg tta teg cac tta age etc gga aca act 330 
Phe Ser Ser Ala Leu Gin Gly Leu Ser His Leu Ser Leu Gly Thr Thr 
25 30 35 

tta cca gtg att eta cca gaa agg aat gaa gaa cag aac ctt cag gaa 378 
Leu Pro Val lie Leu Pro Glu Arg Asn Glu Glu Gin Asn Leu Gin Glu 
40 45 50 

ttg agt cac aat gca gac aaa tat caa atg gga gat tgt tgc aag gaa 426 
Leu Ser His Asn Ala Asp Lys Tyr Gin Met Gly Asp Cys Cys Lys Glu 
55 60 65 70 

gag att gat gat agt att ttc tac tag ccatt gggaagataa aaggagacag 478 
Glu He Asp Asp Ser He Phe Tyr * 

75 

aagattgaag cctttgccag ccattctttc cetttttgct tceaaactce tcaaetggga 538 

acctteatat gtgcagtatt tatattggat catactggtg attataaaag ttcctaggag 598 

gctagaagag ccaaecaaca gagaagggaa agcagtctgt t 639 
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<210> 9 

<211> 3522 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (416) . . (2353) 
<220> 

<221> misc_feature 
<222> (1) . . . (3522) 

<223> n s a/t,c or g 



<400> 

ccgagccgag 


9 

cgccgaggac 


cgtgctaccc caggcccggg 


ctcgcccagc 


gcaggctcct 


ctctggcagc 


agcggcggcg 


cggngaccGC ngtccctcgg 


cctccnnttc 


ccatcccacc 


tcccgagcct 


tcctcttccc 


gcagcacgcc cggcccggcc 


cggccgtggc 


cctcctcagt 


gccggccgcd: 


atggcagagg 


cgtccggcgc ggggaaaatc 


tagcccgggg 


atttcatgcg 


gcctagctcg 


gttccgcctc 


ctcctcgcgc ggccccagcg 


gctgcccgca 


ccccagcccc 


actccgggcc 


tccgtgtctc 


tcctgtgatc gcactgacac 


ggccgggggg 


ttagaatgga 


acaaactgaa 


ggcccgatga 


gagaaaggga aagttaagga 


tgctggagca 


gaaca atg 



Met 
1 



60 
120 
180 
240 
300 
360 
418 



gat ttc tct ttc tct ttc atg caa ggg ate atg gga aac aca att cag 466 
Asp Phe Ser Phe Ser Phe Met Gin Gly lie Met Gly Ajsn Thr lie Gin 

5 10 15 

caa cca cct caa etc att gac tec gcc aac ate egt cag gag gat gcc 514 
Gin Pro Pro Gin Leu lie Asp Ser Ala Asn. lie Arg Gin Glu Asp Ala 
20 25 30 

ttt gat aac aac agt gac att get: gaa gat ggt ggc cag aca cca tat 562 
Phe Asp Asn Asn Ser Asp lie Ala Glu Asp Gly Gly Gin Thr Pro Tyr 
35 40 45 

gaa get act ttg cag eaa ggc ttt cag tac cca get aca aca gaa gat 610 
Glu Ala Thr Leu Gin Gin Gly Phe Gin Tyr Pro Ala Thr Thr Glu Asp 
50 55 60 65 

ctt ect cca etc aca aat ggg tat cca tea tea ate agt gtg tat gaa 658 
Leu Pro Pro Leu Thr Asn Gly Tyr Pro Ser Ser lie Ser Val Tyr Glu 

70 75 80 

act caa ace aaa tac cag tea tat aat cag tat cct aat ggg tea gcc 706 
Thr Gin Thr Lys Tyr Gin Ser Tyr Asn Gin Tyr Pro Asn Gly Ser Ala 

85 90 95 

aat ggc ttt ggt gca gtt aga aac ttt age ccc act gac tat tat cat 754 
Asn Gly Phe Gly Ala Val Arg Asn Phe Ser Pro Thr Asp Tyr Tyr His 
100 105 110 

tea gaa att cca aac aca aga cca cat gaa att ctg gaa aaa cct tec 802 
Ser Glu He Pro Asn Thr Arg Pro His Glu He Leu Glu Lys Pro Ser 
115 120 125 
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cct cca cag cca cca cct cct cct teg gta cca caa act gtg att cca 850 

Pro Pro Gin Pro Pro Pro Pro Pro Ser Val Pro Gin Thr Val lie Pro 

130 135 140 145 

aag aag act ggc tea cct gaa att aaa eta aaa ata acc aaa act ate 898 

Lys Lys Thr Gly Ser Pro Glu lie Lys Leu Lys lie Thr Lys Thr He 

150 155 160 

cag aat ggc agg gaa ttg ttt gag tct tec ctt tgt gga gac ett tta 946 

Gin Asn Gly Arg Glu Leu Phe Glu Ser Ser Leu Cys Gly Asp Leu Leu 

165 170 175 

aat gaa gta cag gca agt gag cac acg aaa tea aag cat gaa age aga 994 

Asn Glu Val Gin Ala Ser Glu His Thr Lys Ser Lys His Glu Ser Arg 

180 185 190 

aaa gaa aag agg aaa aaa age aac aag eat gac tea tea aga tct gaa 1042 

Lys Glu Lys Arg Lys Lys Ser Asn Lys His Asp Ser Ser. Arg Ser Glu 

195 200 205 

gag cgc aag tea cae aaa ate cec aaa tta gaa cca gag gaa caa aat 1090 

Glu Arg Lys Ser His Lys He Pro Lys Leu Glu Pro Glu Glu Gin Asn 

210 215 220 225 

aga cca aat gag agg gtt gac act gta tea gaa aaa cea agg gaa gaa 1138 

Arg Pro Asn Glu Arg Val Asp Thr Val Ser Glu Lys Pro Arg Glu Glu 

230 235 240 

cca gta eta aaa gag gaa gcc cca gtt cag cea ata eta tct tct gtt 1186 

Pro Val Leu Lys Glu Glu Ala Pro Val Gin Pro lie Leu Ser Ser Val 

245 250 255 

cea aca acg gaa gtg tec act ggt gtt aag ttt cag gtt ggc gat ett 1234 

Pro Thr Thr Glu Val Ser Thr Gly Val Lys Phe Gin Val Gly Asp Leu 

260 265 270 

gtg tgg tec aag gtg gga acc tat cct tgg tgg cct tgt atg gtt tea 1282 

Val Trp Ser Lys Val Gly Thr Tyr Pro Trp Trp Pro Cys Met Val Ser 

275 280 285 

agt gat cec cag ctt gag gtt cat act aaa att aac aca aga ggt gcc 1330 

Ser Asp Pro Gin Leu Glu Val His Thr Lys He Asn Thr Arg Gly Ala 

290 295 300 305 

ega gaa tat cat gtc cag ttt ttt age aac cag cea gag agg geg tgg 1378 

Arg Glu Tyr His Val Gin Phe Phe Ser Asn Gin Pro Glu Arg Ala Trp 

310 315 320 

gtt cat gaa aaa egg gta cga gag tat aaa ggt cat aaa cag tat gaa 142 6 

Val His Glu Lys Arg Val Arg Glu Tyr Lys Gly His Lys Gin Tyr Glu 

325 330 335 

gaa tta ctg get gag gca acc aaa caa gcc age aat cac tct gag aaa 1474 

Glu Leu Leu Ala Glu Ala Thr Lys Gin Ala Ser Asn His Ser Glu Lys 

340 345 350 

caa aag att egg aaa cec cga cct cag aga gaa cgt get cag tgg gat 1522 

Gin Lys He Arg Lys Pro Arg Pro Gin Arg Glu Arg Ala Gin Trp Asp 

355 360 365 

att ggc att gcc cat gca gag aaa gca ttg aaa atg act cga gaa gaa 1570 

He Gly He Ala His Ala Glu Lys Ala Leu Lys Met Thr Arg Glu Glu 

370 375 380 385 



18 



wo 01/57190 



PCT/USOl/04098 



aga ata gaa cag tat act ttt att tac att gat aaa cag cct gaa gag 1618 
Arg lie Glu Gin Tyr Thr Phe He Tyr He Asp Lys Gin Pro Glu Glu 

390 395 400 

get tta tec caa gca aaa aag agt gtt gcc tec aaa acc gaa gtt aaa 1666 
Ala Leu Ser Gin Ala Lys Lys Ser Val Ala Ser Lys rhr Glu Val Lys 

405 410 415 

aaa acc cga cga eca aga tct gtg ctg aat act cag cca gaa cag acc 1714 
Lys Thr Arg Arg Pro Arg Ser Val Leu Asn Thr Gin Pro Glu Gin Thr 
420 425 430 

aat gca ggg gag gtg gcc tec tea etc tea agt act gaa att egg aga 1762 
Asn Ala Gly Glu Val Ala Ser Ser Leu Ser Ser Thr Glu He Arg Arg 
435 440 445 

eat age eag agg egg cac aca agt gcg gaa gag gaa gag cca eeg cct 1810 
His Ser Gin Arg Arg His Thr Ser Ala Glu Glu Glu Glu Pro Pro Pro 
450 455 460 465 

gtt aaa ata gcc tgg aaa act gcg gca gca agg aaa tec tta cca get 1858 
Val Lys ; He Ala Trp Lys Thr Ala Ala Ala T^g Lys Ser Leu Pro Ala 

470 475 480 

tec att aeg atg cac aaa ggg age ctg gat ttg cag aag tgt aac atg 1906 
Ser He Thr Met His Lys Gly Ser Leu Asp Leu Gla Lys Cys Asn Met 

485 490 495 

tct eca gtt gtg aaa att gaa caa gtg ttt get ctt cag aat get aca 1954 
Ser Pro Val Val Lys He Glu Gin Val Phe Ala Leu Gin Asn Ala Thr 
500 505 510 

ggg gat ggg aaa ttt ate gat caa ttt gtt tat tea aca aag gga att 2002 
Gly Asp Gly Lys Phe He Asp Gin Phe Val Tyr Ser Thr Lys Gly He 
515 520 525 

ggt aac aaa aca gaa ata agt gte agg ggg caa gac agg ctt ata att 2O50 
Gly Asn Lys Thr Glu He Ser Val Arg Gly Gin Asp Arg Leu He He 
530 535 540 545 

tct aca cca aac cag aga aat gaa aag cca acg cag agt gta tea tct 2098 
Ser Thr Pro Asn Gin Arg Asn Glu Lys Pro Thr Gin Ser Val Ser Ser 

550 555 560 

cct gaa gca aca tct ggt tct aca ggc tea gta gaa aag aag caa cag 2146 
Pro Glu Ala Thr Ser Gly Ser Thr Gly Ser Val Glu Lys Lys Gin Gin 

565 570 575 

aga aga tea att aga act egt tct gaa tea gag aaa tec act gag gtt 2194 
Arg Arg Ser He Arg Thr Arg Ser Glu Ser Glu Lys Ser Thr Glu Val 
580 585 590 

gtg cca aag aag aag ate aaa aag gag cag gtt gaa aca gtt cct cag 2242 
Val Pro Lys Lys Lys He Lys Lys Glu Gin Val Glu Thr Val Pro Gin 
595 600 605 

get aca gtg aag act gga tta cag aaa ggg. teg gcg gac egg gga gtg 2290 
Ala Thr Val Lys Thr Gly Leu Gin Lys Gly Ser Ala Asp Arg Gly Val 
610 615 620 625 

cag ggc tct gtc aga tte agt gac age tec gte tec gca gcg att gag 2338 
Gin Gly Ser Val Arg Phe Ser Asp Ser Ser Val Ser Ala Ala He Glu 

630 635 640 
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gaa act gtg gac tga gattcctgta caatttcatc ccagaaactc cagacttgta 2.393 
Glu Thr Val Asp * 

645 



gtctccatgc 


aagatttctt 


tgtcggcggc 


ttgataaaca gtttctttgt tttcgatttt 


2453 


gatttcgcca 


atcatcatta 


ttggcatttt 


cctgcctggt ttcttcttca agactctgaa 


2513 


caattgcttt 


aacagtcaaa 


tgattttttt 


ttttcggttt gagctggatg ggtacagctt 


2573 


aaatcatggg 


tccagcctaa 


aaaccaccat 


ttaacttaca ctgatcaatt tcaacatgga 


2633 


ctgtttttgg 


ttttttgttt 


tt-aaataaag 


catcattaat gcacatctgc aggggtttgc 


2693 


caaacagccc 


aaactgtata 


cattacaatc 


attaaaagtt cttatttttt ttaatattag 


2753 


tgccgttatc 


atggagaaca 


gcatgacagc 


tgtctttggc agtctgtcat ttttctagca 


2813 


ttttcagaaa 


ctcatcggaa 


atggcggtac 


ctgtgtttcc cttcgaaagc ctctcagtac 


2873 


agcactcctg 


ttcctctgtt 


aaaactcctt 


gttaatccag tgatctttta ggccaaggaa 


2933 


atattttgtg 


atggtgttct 


gggtccatac 


accagcaat'g aaggagatag atttgtgtac 


2993 


ttgtgttttt 


taatcagcat: 


taacatgggc 


aggcaccctc atttatagat gtcaggaaac 


3053 


attcagtgaa 


aaacttgtag 


aatgggatgt 


gataacgagg ttccagtaat ctgagcagtc 


3113 


taacgaggcc 


cacctccbcc 


accacagaac 


gtggctatgt tccaagtgct actctcactc 


3173 


agcctgttgc 


ggatcttcat 


ggcctcagga 


gacttgtttc tccatgggct cttctggact 


3233 


gcacacttcc 


accatagctt 


gctgggttga 


tctagatgtc tgtttgttgt atggaaattt 


3293 


tgggggaaaa 


aatccaaaac 


ac'aaactgtg 


ggttgaaata ttaaccgtct ccttggttcc 


3353 


c ugg ua u u ca 


ccgcgcccga 


t" 4* ^^^3 ^ 3 ^ 

^Cugccicauw 


ucaucgcggc uguuucugua uagwv;uoti.ac 




tgcattagcc 


caagagattg 


ttgctttgta 


actttttgca ctattgtttt ggctggattt 


3473 


gtattacaca 


cagttttaaa 


aaaaacaatt 


ccacactaaa aaaaaaaaa 


3522 


<210> 
<211> 
<212> 


10 

2366 
DNA 









<213> Homo sapiens 

<220> 
<221> CDS 

<222> (174) . . (1541) 
<400> 10 

cctcagcctc cccgccgccg ccgccgccgc cgccgccgag ccggtttcct ttttccggcg 60 

ctccgggtgc gagagacagg tcgggccccc taggcagcga gccgcagcgc aatcccggcg 120 

ctcgcccaag gaccctggaa gctaccgtta ccccgccggg cagcgtgggc gcc atg 176 

Met 
1 
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age age teg gga ctg aat teg gag aag gta get get ctg at a cag aaa 224 
Ser Ser Ser Gly Leu Asn Ser Glu Lys Val Ala Ala Leu lie Gin Lys 

5 10 15 

ctg aat tec gac ccc cag ttc gta ctt gee cag aat gtc ggg acc ace 272 
Leu Asn Ser Asp Pro Gin Phe Val Leu Ala Gin Asn Val Gly Thr Thr 
20 25 30 

cac gac ctg ctg gac ate tgt ctg aag egg gee acg gtg cag cge gcg 320 
His Asp Leu Leu Asp lie Cys Leu Lys Arg Ala Thr Val Gin Arg Ala 
35 40 45 

eag cat gtg ttc cag cac gee gtg cce cag gag ggc aag cca ate aec 368 
Gin His Val Phe Gin His Ala Val Pro Gin Glu Gly Lys Pro lie Thr 
50 55 60 65 

aae cag aag age tea ggg cga tgc tgg ate ttt tct tgt ctg aat gtt 416 
Asn Gin Lys Ser Ser Gly Arg Cys Trp lie Phe Ser Cys Leu Asn Val 

70 75 80 

atg agg ctt cca ttc atg aaa aag tta aat att gaa gaa ttt gag ttt 464 
Met Arg Leu Pro Phe Met Lys Lys Leu Asn lie Glu Glu Phe Glu Phe 

85 90 95 

age caa tet tac ctg ttt ttt tgg gac aag gtt gaa cgc tgt tat ttc 512 
Ser . Gin Ser Tyr Leu Phe Phe Trp Asp Lys Val Glu Arg Cys Tyr Phe 
100 105 110 

ttc ttg agt get ttt gtg gac aca gee cag aga aag gag cct gag gat 560 
Phe Leu Ser Ala Phe Val Asp Thr Ala Gin Arg Lys Glu Pro Glu Asp 
115 120 125 

ggg agg ctg gtg cag ttt ttg ctt atg aac cct gca aat gat ggt ggc 608 
Gly Arg Leu Val Gin Phe Leu Leu Met Asn Pro Ala Asn Asp Gly Gly 
130" 135 140 145 

caa tgg gat atg ctt gtt aat att gtt gaa aaa tat ggt gtt ate cct 656 
Gin Trp Asp Met Leu Val Asn lie Val Glu Lys Tyr Gly Val lie Pro 

150 155 160 

aag aaa tgc ttc cct gaa tct tat aca aca gag gca acc aga agg atg 704 
Lys Lys Cys Phe Pro Glu Ser Tyr Thr Thr Glu Ala Thr Arg Arg Met 

165 170 175 

aat gat att ctg aat cac aag atg aga gaa ttc tgt ata cga ctg egg 752 
Asn Asp lie Leu Asn His Lys Met Arg Glu Phe Cys lie Arg Leu Arg 
180 185 190 

aac ctg gta cac agt gga gca acc aaa gga gaa ate teg gcc aca cag 800 
Asn Leu Val His Ser Gly Ala Thr Lys Gly Glu He Ser Ala Thr Gin 
195 200 205 

gac gtc atg atg gag gag ata ttc cga gtg gtg tgc ate tgt ttg ggt 848 
Asp Val Met Met Glu Glu He Phe Arg Val Val Cys He Cys Leu Gly 
210 215 220 225 

aat cca cca gag aca ttc ace tgg gaa tat cga gac aaa gat aaa aat 896 
Asn Pro Pro Glu Thr Phe Thr Trp Glu Tyr Arg Asp Lys Asp Lys Asn 

230 235 240 

tat cag aaa att ggc ccc ata aca cce ttg gag ttt tae agg gaa cat 944 
Tyr Gin Lys He Gly Pro He Thr Pro Leu Glu Phe Tyr Arg Glu His 

245 250 255 
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gtc aag cca etc ttc aat atg gaa gat aag att tgt tta gtg aat gac 992 

Val Lys Pro Leu Phe Asn Met Glu Asp Lys lie Cys Leu Val Asn Asp 
260 265 270 

cct agg ccc cag cac aag tac aac aaa ctt tac aca gtg gaa tac tta 1040 

Pro Arg Pro Gin His Lys Tyr Asn Lys Leu Tyr Thr Val Glu Tyr Leu 
275 280 285 

age aat atg gtt gga ggg aga aaa act eta tac aac aac cag ccc att 1088 

Ser Asn Met Val Gly Gly Arg Lys Thr Leu Tyr Asn Asn Gin Pro He 

290 295 300 305 



gac ttc ctg aaa aag atg gtt get gcc tec ate aaa gat gga gag get 1136 
Asp Phe Leu Lys Lys Met Val Ala Ala Ser lie Lys Asp Gly Glu Ala 

310 315 320 

9^9 tgg ttt ggc tgt gat gtt gga aaa cac ttc aat age aag ctg ggc 1184 
Val Trp Phe Gly Cys Asp Val Gly Lys His Phe Asn Ser Lys Leu Gly 

325 330 335 

etc agt gac atg aat etc tat gac cat gag tta gtg ttt ggt gtc tec 1232 
Leu Ser Asp Met Aisn Leu Tyr Asp His Glu Leu Val Phe Gly Val Ser 
340 345 350 

ttg aag aac atg aat aaa gcg gag agg ctg act ttt ggt gag tea ctt 1280 
Leu Lys Asn Met Asn Lys Ala Glu Arg Leu Thr Phe Gly Glu Ser Leu 
355 360 365 

atg acc cac gee atg ace ttc act get gtc tea gag aag gat gat cag 1328 
Met Thr His Ala Met Thr Phe Thr Ala Val Ser Glu Lys Asp Asp Gin 
370 375 380 385 

gat ggt get ttc aca aaa tgg aga gtg gag aat tea tgg ggt gaa gac 1376 
Asp Gly Ala Phe Thr Lys Trp Arg Val Glu Asn Ser Trp Gly Glu Asp 

390 395 40O 

cat ggc cac aaa ggt tac ctg tgc atg aca gat gag tgg ttc tct gag 1424 
His Gly His Lys Gly Tyr Leu Cys Met Thr Asp Glu Trp Phe Ser Glu 

405 410 415 

tat gtc tac gaa gtg gtg gtg gac agg aag cat gtc cct gaa gag gtg 1472 
Tyr Val Tyr Glu Val Val Val Asp Arg Lys His Val Pro Glu Glu Val 
420 425 430 

eta get gtg tta gag cag gaa ccc att ate ctg cca gca tgg gac ccc 1520 
Leu Ala Val Leu Glu Gin Glu Pro lie He Leu Pro Ala Trp Asp Pro 
435 440 445 

21 tg gga get ttg get gag tga ta ctgccctcca getctttcct ccttceatgg 1573 
Met Gly Ala Leu Ala Glu * 
450 455 

aacctgacgt agctgcaaag gacagatcca gggactgaag ccaaagttat gcaagggact 1633 

gtgtgttgcc acaggacaca gtcagattte cagtctccac caggaacctc ttcagaaagt 1693 

gtgctttatg ctgaaacaga atactgttaa aggaaaaaaa agagggggga agatcaggtc 1753 

ataetatcta ctctcctcat ctctaacagc teaggatctc ttagcatttt aattagatgt 1813 

aattgtttgt ctttaaetgt eaaaaagttt ggttetgtgt ctgtgtttta ataagacgag 1873 

aggacgagcg attgaggtgt atggagagaa aacagaccta atgctccttg ttcctagagt 1933 
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agagtggagg 


gagggtggcc 


taagagttga gctctcggaa 


ctgcatgctg 


ctggacagta 


1993 


tcactgtctt 


tcctagatgg 


cagtcactga attccatttt 


ttcaaggtaa 


tttcttgtgc 


2053 


ctctaatagc 


ccaagaatgg 


gaggttgatc agatctgaca 


tgattccttc 


ctgttctgaa 


2113 


ctgtggggtg 


cacatctctg 


cttgagtcag gtttgagtag 


aggcttagag 


acagttgggt 


2173 


gagaacaacc 


aaaatcttat 


catggtctca gtcataatca 


ttagggggaa 


ctctagccaa 


2233 


atggtttaac 


ttictgcctgt 


ggaactgggg attgggtggg 


caggaaaagg 


tgatatccat 


2293 


tctttctgat 


aactagatgg 


tgctgagaag cttttgaata 


aaaactttgc 


taaatgagaa 


2353 


taagctgaaa 


aaa 








2366 



<210> 11 

<211> 1732 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (241) . . (495) 

<400> 11 

gatgcctgat ctcatcaatc tagcgggaga gacaggataa cctgtccgag agtatagcgc 60 

cacttatgac tccgccggaa aaattacttt aaaaatcgcc aaaaattact tggagcaaag 120 

ggcagtccgg cggcgttcgc caaggtggcg cagtcggttt tgacctgtag cagagaacca 180 

attctggaga acagcctcac ttctttgatt gaatacttac ataatgcatt ggaacatgac 240 

atg aga tta agg ttt aat aat gat aga atg aag acc aca ata aaa gag 288 
Met Arg Leu Arg Phe Aszi Asn Asp Arg Met Lys Thr Thr lie Lys Glu 
15 10 15 

acc aca ate ctt age tea gca att ctt ace ttt ctt acc tat ttg atg 336 
Thr Thr lie Leu Ser Ser Ala lie Leu Thr Phe Leu Thr Tyx Leu Met 

20 25 30 

aag atg tct ttt gaa agg tgt act gca agg aac aaa atg ttt gta aat 384 
Lys Met Ser Phe Glu Arg Cys Thr Ala Arg Asn Lys Met Phe Val Asn 
35 40 45 

tct ccc ttt tac cca agg gtg gat aat tac tgt acc tec tea tgg aaa 432 
Ser Pro Phe Tyr Pro Arg Val Asp Asn Tyr Cys Thr Ser Ser Trp Lys 
50 55 60 

aag ttt tat tta aag tgt tat ttc tea ttg aat act ate aaa aag gaa 480 
Lys Phe Tyr Leu Lys Cys Tyr Phe Ser Leu Asn Thr lie Lys Lys Glu 
65 70 75 80 

aaa aaa atg ace taa acttttgaga tagatttggc tctagtaagt atttagttat 535 
Lys Lys Met Thr * 

85 

ateacttgca tatetgggag aagaaataag agactatcat cagtacattc ccatctacta 595 
aaaaaattta ttttacacat gtcaagggat tacttataac ttccatttta ttactaatag 655 
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cttgaaccct 


tttaatgaag 


acctaactcc tccaccagaa 


atttaagttt atgttcttac 


715 


tttgtttact 


tataaaatac 


atctcaggta tttcggatgt 


cttttttttt tctaagccta 


775 


tatgaaatga 


aaaatatatt 


ggcaaagtaa atgtttaaac 


cttttacgtt aaaattactt 


835 


tgaaagatga 


aaagttagtg 


ctgtttttgt cacgttatac 


tgaaattaaa tgtttataat 


895 


ttatattttg 


ggtttatgta 


taaatcatgg aatttatgca 


aaaatatgag tagtacagat 


955 


tctcctctaa 


ttctgtagga 


ctttgaataa t^tgatattt 


ttcttataat tggacccttg 


1015 


tgttttgaag 


aaatgccaac 


tgcttgaaga atctccttgt 


tatttgtatt atttgctata 


1075 


gggttagatg 


ttgagaaatt 


ctgctgacaa aaaattttaa 


gccagtttta cactaaatgt 


1135 


tcctcagtct 


gattaatttg 


ttattggatg tattctgtat 


ctttcttttg taatttgtaa 


1195 


cttttatcca 


cttagcacga 


atgattctat taaagaaaat 


cattaggaag tggtagaaac 


1255 


tttaaatcgc 


cccagagttt 


gcctgtttcc atattttatt 


atcttataat cttcgggagt 


1315 


gcttacactt 


atggagctaa 


cattttcaga gatacagctt 


cttatagtaa cactaaaact 


1375 


ttcttcctct 


ttggactgaa 


tacctataat tataactata 


tggtagttta agtttccttg 


1435 


tgattagtca 


aaaataccat 


tttagtabga agcaatgaag 


tctattattt gttgtcccat 


1495 


aattgagaaa 


gcttaaatac 


accttttatt aagagtttgt 


aaattctagc ttagtctaca 


1555 


cagattttta 


tatcaatttg 


tttatatttt tattaatgtc 


atttctggaa gtgtgaaaat 


1615 


gttaatgttc 


aacaagcaac 


attaaaaata gatttgaaac 


atttatatat agagaggtac 


1675 


acatttattt 


actgtttagg 


tactgaa^at tatcacttaa 


taaaaaatat atatccc 


1732 



<210> 12 
<211> 2913 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (511) . . (2913) 
<4O0> 12 

atgaatatgg acatcaatga ttggtacttc gctactcatc atggtgatat tagcggttct 60 
agacatatcc tggatttggg actactaaaa gcgcaaaaga aggcccagag aagggagcac 120 
atgctaaaac ttgaggctga gaagaaaaag cttcgaacta tacttcaagt tcagtatgta 180 
ttgcagaact tgacacagga gcacgtacaa aaagacttca aagggggttt gaatggtgca 240 
gtgtatttgc cttcaaaaga acttgactac ctcattaagt tttcaaaact gacctgccct 300 
gaaagaaatg aaagtctgag tgttgaagac cagatggagc agtcatcctt gtacttttgg 36 0 
gaccttttgg aaggtagtga gaaagcagtg gtaggaacga catacaaaca cttgaaggat 420 
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